Thibault Sellam (@thiboibo) Twitter Tweets • TwiCopy

Sebastian Gehrmann

4 years ago

Listing issues in NLG evaluations turned into a 25 page survey! In “Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text”, Thibault Sellam Elizabeth Clark and I cover 250+ papers. 📄Link: arxiv.org/abs/2202.06935 Want to learn more?👇

thumb_up_off_alt197

chat_bubble_outline6

repeat67

shareShare

Dipanjan Das

@dipanjand

3 years ago

My colleagues Chris Alberti, Kuzman Ganchev and I are looking for a fall intern. The topic is improving language technologies for underrepresented languages by leveraging large pretrained language models. The position could be in NYC or remote. If interested, please reach out.

thumb_up_off_alt109

chat_bubble_outline10

repeat37

shareShare

Thibault Sellam

@thiboibo

3 years ago

Currently presenting MultiBERTs at ICLR, we will be at Poster Session 5 until 12.30pm Pacific

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Kelvin Guu

@kelvin_guu

3 years ago

New from Google Research! Language models perform amazing feats, but often still "hallucinate" unsupported content. Our model, RARR🐯, automatically researches & revises the output of any LM to fix hallucinations and provide citations for each sentence. arxiv.org/abs/2210.08726 🧵

thumb_up_off_alt800

chat_bubble_outline10

repeat150

shareShare

Ran TIAN

@robin_tian

3 years ago

Amos is a new optimizer that we propose to pre-train large language models. It is more efficient and converges faster than AdamW: ≤ 51% memory for slot variables, and better valid loss within ≤ 70% training time! New from Google Research. Preprint: arxiv.org/abs/2210.11693

thumb_up_off_alt782

chat_bubble_outline14

repeat144

shareShare

Elizabeth Clark

@eaclark07

2 years ago

We are excited to release Seahorse 🌊🐴, a ✨multilingual, multifaceted summarization evaluation dataset✨ 96,000+ human ratings to enable faster progress in training and evaluating learnt metrics for summarization! Preprint: arxiv.org/abs/2305.13194 Data: goo.gle/seahorse

thumb_up_off_alt514

chat_bubble_outline8

repeat116

shareShare

Google AI

@googleai

2 years ago

Learn how SQuId (Speech Quality Identification), a 600M parameter regression model that describes to what extent a piece of speech sounds natural, can be used to complement human ratings for the text-to-speech evaluation of many languages → goo.gle/3J1kghg

thumb_up_off_alt252

chat_bubble_outline23

repeat66

shareShare

Sam Fraiberger 🔎🌍

@spfraib

2 years ago

🚨 Dream Job Alert 🚨 We are looking to fill various positions including NLP researcher, data engineer and software developer. If you are interested in #LLM, causal inference and having a positive impact on the world, please reach out! worldbank.org/en/research/di…

thumb_up_off_alt97

chat_bubble_outline2

repeat50

shareShare

Thibault Sellam

@thiboibo

2 years ago

The Searhorse dataset is available - 96K ratings to train and evaluate new summarization metrics. Congrats Elizabeth Clark and team! Paper here: arxiv.org/abs/2305.13194 Models here: huggingface.co/collections/go…

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Thibault Sellam

@thiboibo

2 years ago

I am glad to share what we've been up to for the past few months ♊✨

thumb_up_off_alt30

chat_bubble_outline2

repeat2

shareShare

Machel Reid

@machelreid

2 years ago

Super excited for this launch! Had a great time working with a really amazing team getting this out!! Check it out:

thumb_up_off_alt94

chat_bubble_outline2

repeat7

shareShare

Pete Shaw

@ptshaw2

a year ago

Excited to share new work from Google DeepMind: “ProtEx: A Retrieval-Augmented Approach for Protein Function Prediction” biorxiv.org/content/10.110…

Excited to share new work from <a href="/GoogleDeepMind/">Google DeepMind</a>: “ProtEx: A Retrieval-Augmented Approach for Protein Function Prediction”

biorxiv.org/content/10.110…

thumb_up_off_alt160

chat_bubble_outline4

repeat41

shareShare

Aran Komatsuzaki

@arankomatsuzaki

a year ago

Google presents Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More? Long-context LM: - Often rivals SotA retrieval and RAG systems - But still struggles with areas like compositional reasoning repo: github.com/google-deepmin… abs: arxiv.org/abs/2406.13121

thumb_up_off_alt329

chat_bubble_outline3

repeat86

shareShare

iseeaswell꩜bʂky

@iseeaswell

a year ago

The playlist starts in West Africa and wanders West-to-East until it hits Brazil. It’s composed of songs supplied by native speakers, which tend to be bangers. youtube.com/playlist?list=…

thumb_up_off_alt22

chat_bubble_outline1

repeat4

shareShare

Jacob Austin

@jacobaustin132

7 months ago

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat377

shareShare