David Ifeoluwa Adelani 🇳🇬 (@davlanade) Twitter Tweets • TwiCopy

Irene Li

6 months ago

📢 Today, we release #MMLUProX, which upgrades MMLU-Pro to 29 languages across 14 disciplines—11,829 reasoning-heavy Qs per language (≈342 k total). The toughest multilingual stress test for today’s LLMs! 🌐🧠 Heartfelt thanks to everyone who contributed.🤝

thumb_up_off_alt62

chat_bubble_outline1

repeat17

shareShare

Fleming Initiative

@flemingcentre

6 months ago

We are proud to invite applications for a new Google DeepMind Academic Fellowship, hosted by the Fleming Initiative at Imperial College London (Imperial College London), that supports groundbreaking research at the intersection of AMR and AI ⤵️ imperial.ac.uk/news/264673/fl…

thumb_up_off_alt67

chat_bubble_outline1

repeat16

shareShare

MELT Workshop

@meltworkshop

6 months ago

🧑‍💻 Call for Reviewers – Melt Workshop Conference on Language Modeling 2025 🌍 We're looking for researchers passionate about multilingual, multicultural, and inclusive NLP to join our reviewer team! 🔗 Fill out the interest form: forms.gle/MYcXED7RLJDSqi… #MeltWorkshop2025

thumb_up_off_alt10

chat_bubble_outline0

repeat7

shareShare

Wenhu Chen

@wenhuchen

6 months ago

🚨 New Paper Alert 🚨 We found that Supervised Fine-tuning on ONE problem can achieve similar performance gain as RL on ONE problem with 20x less compute! Paper: arxiv.org/abs/2506.03295 Recently, people have shown that RL can work even with ONE example. This indicates that the

thumb_up_off_alt318

chat_bubble_outline10

repeat61

shareShare

Abraham Owodunni

@abrahamowos

6 months ago

After over 2 years of work, I'm glad to be finally wrapping up NaijaVoices with our accepted paper at #Interspeech 25! We created 1800+ hours of multilingual speech data in Hausa, Igbo, & Yoruba across 100 themes with a community we built entirely from scratch (see paper)!

thumb_up_off_alt79

chat_bubble_outline8

repeat14

shareShare

ACLRollingReview

@reviewacl

6 months ago

🚨 New for May 2025: Highly irresponsible reviewers/ACs may become ineligible to commit papers to EMNLP/ARR next cycle. ❗️Reviewers must follow guidelines & deadlines. ❗️Chairs must be notified for emergency. ❗️Do not share with third parties such as commercial LLM services.

thumb_up_off_alt35

chat_bubble_outline3

repeat7

shareShare

Aishwarya Agrawal

@aagrawalaa

6 months ago

If you want to learn more about how culturally inclusive current vision-language models are and what the outstanding research questions in this area are, do stop by our VLMs4All - CVPR 2025 Workshop workshop on June 12th -- sites.google.com/corp/view/vlms…

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

EleutherAI

@aieleuther

6 months ago

Can you train a performant language models without using unlicensed text? We are thrilled to announce the Common Pile v0.1, an 8TB dataset of openly licensed and public domain text. We train 7B models for 1T and 2T tokens and match the performance similar models like LLaMA 1&2

thumb_up_off_alt556

chat_bubble_outline10

repeat127

shareShare

Ziling Cheng

@ziling_cheng

6 months ago

Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts follow a systematic failure mode — revealing how LLMs generalize using abstract classes + context cues, albeit unreliably. 📎 Paper: arxiv.org/abs/2505.22630 1/n

thumb_up_off_alt34

chat_bubble_outline1

repeat20

shareShare

Sarvam AI

@sarvamai

6 months ago

Today we’re announcing Sarvam-Translate, an open-weights model that translates text across 22 Indian languages, with support for long-form text and the ability to handle diverse formats, contexts, and styles. Sarvam-Translate stands out for its ability to handle the complexities

thumb_up_off_alt951

chat_bubble_outline29

repeat182

shareShare

Niyati Bafna

@bafnaniyati

6 months ago

We know speech LID systems flunk on accented speech. But why? And what to do about it?🤔Our work arxiv.org/abs/2506.00628 (Interspeech '25) finds that *accent-language confusion* is an important culprit, ties it to the length of feature that a model relies on, and proposes a fix.

thumb_up_off_alt13

chat_bubble_outline1

repeat7

shareShare

Zhijing Jin✈️ ICLR Singapore

@zhijingjin

6 months ago

Really excited about our recent large collaboration work on NLP for Social Good. The work stems from our discussions at the NLP for Positive Impact Workshop (EMNLP 2024) Workshop at #EMNLP2024 EMNLP 2025. Thanks to all our awesome collaborators, workshop attendees and all supporters!

Really excited about our recent large collaboration work on NLP for Social Good. The work stems from our discussions at the <a href="/NLP4PosImpact/">NLP for Positive Impact Workshop (EMNLP 2024)</a> Workshop at #EMNLP2024 <a href="/emnlpmeeting/">EMNLP 2025</a>. Thanks to all our awesome collaborators, workshop attendees and all supporters!

thumb_up_off_alt102

chat_bubble_outline3

repeat22

shareShare

Graham Neubig

@gneubig

6 months ago

Where does one language model outperform the other? We examine this from first principles, performing unsupervised discovery of "abilities" that one model has and the other does not. Results show interesting differences between model classes, sizes and pre-/post-training.

thumb_up_off_alt80

chat_bubble_outline0

repeat11

shareShare

Sara Hooker

@sarahookr

6 months ago

Truly excellent video by Machine Learning Street Talk about how a handful of providers have systematically overfit to lmarena.ai. 26 mins of video showcase how easy it has been to distort the rankings. As scientists, we must do better. As a community, I hope we can demand better.

Truly excellent video by <a href="/MLStreetTalk/">Machine Learning Street Talk</a> about how a handful of providers have systematically overfit to <a href="/lmarena_ai/">lmarena.ai</a>.

26 mins of video showcase how easy it has been to distort the rankings.

As scientists, we must do better. As a community, I hope we can demand better.

thumb_up_off_alt134

chat_bubble_outline3

repeat22

shareShare

Tech At Bloomberg

@techatbloomberg

6 months ago

Our CTO #DataScience Speaker Series welcomes McGill University's David Adelani (David Ifeoluwa Adelani 🇳🇬) to our Engineering office in NYC to talk with our #AI research engineers about scaling multilingual evaluation of #LLMs to many languages bloom.bg/3ZGFIAf #GenAI #NLProc

Our CTO #DataScience Speaker Series welcomes <a href="/mcgillu/">McGill University</a>'s David Adelani (<a href="/davlanade/">David Ifeoluwa Adelani 🇳🇬</a>) to our Engineering office in NYC to talk with our #AI research engineers about scaling multilingual evaluation of #LLMs to many languages
bloom.bg/3ZGFIAf
#GenAI #NLProc

thumb_up_off_alt34

chat_bubble_outline1

repeat6

shareShare

Data Science for Social Impact

@dsfsi_research

6 months ago

🚀 Help shape African language tech! Take a quick 10-15 min survey by DSFSI's to build fair, community-driven machine translation & small language models for African languages. Your voice matters! 🌍🗣️ 👉 forms.gle/NrDg7kn3jqs3dt… cc Masakhane #AfricanLanguages

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Muhammad AbdulMageed

@mageed

6 months ago

🚩 💬 We're running the NADI 2025 shared task, focused on Multidialectal Arabic Speech Processing. Welcoming y'all! #NADI2025 #ArabicSpeech #ASR #ArabicNLP

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Cohere Labs

@cohere_labs

5 months ago

How can we make language models more flexible to adapt to new languages after pretraining? 🌏 🧠 Our latest work investigates whether a tokenizer trained on more languages than the pretraining target can improve language plasticity without compromising pretraining performance.

thumb_up_off_alt79

chat_bubble_outline1

repeat20

shareShare

Sohee Yang

@soheeyang_

5 months ago

🚨 New Paper 🧵 How effectively do reasoning models reevaluate their thought? We find that: - Models excel at identifying unhelpful thoughts but struggle to recover from them - Smaller models can be more robust - Self-reevaluation ability is far from true meta-cognitive awareness

thumb_up_off_alt103

chat_bubble_outline3

repeat24

shareShare

Josh Meyer

@_josh_meyer_

5 months ago

The NaijaVoices Dataset (accepted to Interspeech 2025) arXiv link: arxiv.org/abs/2505.20564 video overview: supabase.manatee.work/storage/v1/obj…

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare