Dan Garrette (@dhgarrette) 's Twitter Profile
Dan Garrette

@dhgarrette

Research Scientist at Google. My research focuses on Natural Language Processing and Machine Learning.

ID: 75898512

linkhttp://www.dhgarrette.com calendar_today20-09-2009 23:06:27

42 Tweet

513 Followers

231 Following

Google AI (@googleai) 's Twitter Profile Photo

TyDi QA is a new multilingual dataset for information-seeking question answering featuring 11 Typologically Diverse languages and over 200k QA pairs. Learn more and start experimenting with the data and code ↓ goo.gle/39cZVkv

Jonathan Clark (@jonclarkseattle) 's Twitter Profile Photo

New from Google Research: CANINE, a pre-trained tokenization-free language encoder. This frees us from a variety of pitfalls associated with tokenization, but also improves quality on TyDi QA, a multilingual question answering benchmark. arxiv.org/abs/2103.06874

New from Google Research: CANINE, a pre-trained tokenization-free language encoder. This frees us from a variety of pitfalls associated with tokenization, but also improves quality on TyDi QA, a multilingual question answering benchmark. arxiv.org/abs/2103.06874
Stanford NLP Group (@stanfordnlp) 's Twitter Profile Photo

Just when subword models (BPE, SentencePiece, supported by the great Tokenizers library github.com/huggingface/to…) have become the new standard practice in #NLProc, signs emerge that their days may be numbered

Jason Wei (@_jasonwei) 's Twitter Profile Photo

New paper to appear in #emnlp2021! arxiv.org/abs/2109.07020 We study the syntactic abilities of BERT by manipulating the training corpus and retraining BERT.

Noah Constant (@noahconst) 's Twitter Profile Photo

Want your image generation model to stop misspelling everything? Try giving it access to character-level input features! arxiv.org/abs/2212.10562

Want your image generation model to stop misspelling everything? Try giving it access to character-level input features! arxiv.org/abs/2212.10562
Google AI (@googleai) 's Twitter Profile Photo

Presenting FRMT, a new dataset and evaluation benchmark for Few-Shot Region-Aware Machine Translation that seeks to drive research progress on equitably serving speakers of different language varieties. Learn more and see how current models fare → goo.gle/3IvnPwc

Dan Garrette (@dhgarrette) 's Twitter Profile Photo

Our low-resource POS-tagging NAACL (final) and ACL (draft) papers are now online along with source code: github.com/dhgarrette/low… #nlproc

Dan Garrette (@dhgarrette) 's Twitter Profile Photo

My #naacl2013 talk slides on Low-Resource POS-Tagging are now posted on cs.utexas.edu/~dhg/ (.key & .pdf). Paper, code, and data too.

Dan Garrette (@dhgarrette) 's Twitter Profile Photo

The video of my #naacl2013 talk on Learning a POS-Tagger from Two Hours of Annotation is now online: techtalks.tv/talks/learning…

Dan Garrette (@dhgarrette) 's Twitter Profile Photo

My #NAACL2013 presentation on weakly-supervised tagging was nominated for a *Best Talk* award! Watch it here: techtalks.tv/talks/learning… #NLProc

Dan Garrette (@dhgarrette) 's Twitter Profile Photo

Extremely excited to announce that I've accepted a postdoc position with Luke Zettlemoyer at the University of Washington starting in May!

Dan Garrette (@dhgarrette) 's Twitter Profile Photo

New paper! A "supertag-context" CCG parsing model: prefer constituent labels that fit contexts dhgarrette.com/papers/garrett…

New paper! A "supertag-context" CCG parsing model: prefer constituent labels that fit contexts dhgarrette.com/papers/garrett…