Sebastian Ruder (@seb_ruder) 's Twitter Profile
Sebastian Ruder

@seb_ruder

Research Scientist @AIatMeta • Ex @Cohere @GoogleDeepMind

ID: 2785337469

linkhttp://www.ruder.io calendar_today26-09-2014 12:47:34

4,4K Tweet

88,88K Followers

1,1K Following

ChipAgents.ai (@alphadesignai) 's Twitter Profile Photo

🚀Introducing ChipAgents: the World's First AI Agent for Chip Design and Verification. Get ready to supercharge your workflow and accelerate your time-to-market! 💻⚡

Sebastian Ruder (@seb_ruder) 's Twitter Profile Photo

If you're looking for an MSc or PhD position, I highly recommend applying to David's lab! David is an amazing, humble, and kind researcher and Mila is a great research environment.

Monojit Choudhury (@monojitchou) 's Twitter Profile Photo

Some exciting news from SUMEval-2. Consider submitting your work on multilingual and multicultural NLP. We are accepting both unpublished (for archival) and previously published (for non-archival) work. #NLProc

Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Introducing ✨Aya Expanse ✨ – an open-weights state-of-art family of models to help close the language gap with AI. Aya Expanse is both global and local. Driven by a multi-year commitment to multilingual research. cohere.com/research/aya

Srishti Gureja (@srishti_gureja) 's Twitter Profile Photo

✨ New Evaluation Benchmark for Reward Models - We Go Multilingual! ✨ Introducing M-RewardBench: A massively multilingual RM evaluation benchmark covering 23 typologically different languages across 5 tasks. Paper, code, dataset: m-rewardbench.github.io Our contributions: 1/9

✨ New Evaluation Benchmark for Reward Models - We Go Multilingual! ✨

Introducing M-RewardBench: A massively multilingual RM evaluation benchmark covering 23 typologically different languages across 5 tasks.
Paper, code, dataset: m-rewardbench.github.io

Our contributions:
1/9
Sebastian Ruder (@seb_ruder) 's Twitter Profile Photo

Reward models are crucial for aligning models to human preferences but so far their evaluation has been limited to English. I was fortunate to be involved with this Cohere For AI project, which introduces a new multilingual RM benchmark and many insightful analyses.

Yanai Elazar (@yanaiela) 's Twitter Profile Photo

On that note, someone organizing a workshop at ACL 2025 (ACL 2025) wants to switch with our NAACL 2025 slot? (I guess it's a thing now)

Roberta Raileanu (@robertarail) 's Twitter Profile Photo

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the

Piotr Nawrot (@p_nawrot) 's Twitter Profile Photo

Sparse attention is one of the most promising strategies to unlock long-context processing and long generation reasoning in LLMs. We performed the most comprehensive study on training-free sparse attention to date. Here is what we found:

Sparse attention is one of the most promising strategies to unlock long-context processing and long generation reasoning in LLMs.

We performed the most comprehensive study on training-free sparse attention to date.

Here is what we found:
Piotr Nawrot (@p_nawrot) 's Twitter Profile Photo

We built sparse-frontier — a clean abstraction that lets you focus on your custom sparse attention implementation while automatically inheriting vLLM’s optimizations and model support. As a PhD student, I've learned that sometimes the bottleneck in research isn't ideas — it's

We built sparse-frontier — a clean abstraction that lets you focus on your custom sparse attention implementation while automatically inheriting vLLM’s optimizations and model support.

As a PhD student, I've learned that sometimes the bottleneck in research isn't ideas — it's
Hugo Bowne-Anderson (@hugobowne) 's Twitter Profile Photo

I had lunch with Sebastian Ruder in Berlin a few days ago. Had delicious food and a wonderful, generative conversation about how people are building with AI today, the future of LLMs++, the ins and outs of post-training, what the future of an agentic world could look like, and much

I had lunch with <a href="/seb_ruder/">Sebastian Ruder</a> in Berlin a few days ago. Had delicious food and a wonderful, generative conversation about how people are building with AI today, the future of LLMs++, the ins and outs of post-training, what the future of an agentic world could look like, and much
Kelly Marchisio (St. Denis) (@cheeesio) 's Twitter Profile Photo

The Multilingual Team at cohere is hiring! If this sounds like you, please apply: - strong coding skills and a keen eye for detail - experience working with the challenges & joys of multilingual data Help us bring AI to the world! 🌏🌍🌎 jobs.ashbyhq.com/cohere/a87be94…