Oreva Ahia (@orevaahia) Twitter Tweets • TwiCopy

Valentin Hofmann

6 months ago

Humans store thousands of multi-word expressions like "of course" in their mental lexicon, but current tokenizers don't support multi-word tokens. Enter SuperBPE, a tokenizer that lifts this restriction and brings substantial gains in efficiency and performance! 🚀 Details 👇

thumb_up_off_alt36

chat_bubble_outline0

repeat6

shareShare

Gonçalo Faria

@goncalorafaria

5 months ago

Introducing 𝗤𝗔𝗹𝗶𝗴𝗻🚀, a 𝘁𝗲𝘀𝘁-𝘁𝗶𝗺𝗲 𝗮𝗹𝗶𝗴𝗻𝗺𝗲𝗻𝘁 𝗺𝗲𝘁𝗵𝗼𝗱 that improves language model performance using Markov chain Monte Carlo. With no model retraining, 𝗤𝗔𝗹𝗶𝗴𝗻 outperforms DPO-tuned models even when allowed to match inference compute, and achieves

thumb_up_off_alt113

chat_bubble_outline4

repeat33

shareShare

Sachin Kumar

@shocheen

5 months ago

Really excited for this paper to be out. This project began nearly a year ago when I was at Ai2. Activation steering and related ideas were incredibly appealing, and we explored applying them to a range of problems. But none of the techniques we tried led to meaningful

thumb_up_off_alt54

chat_bubble_outline2

repeat6

shareShare

Cohere Labs

@cohere_labs

5 months ago

We're starting a new chapter as Cohere Labs! 🎉 After 3 years of innovation, our new name reflects our continued dedication to research, collaboration, and open science. Our mission remains: transforming spaces where breakthroughs happen. Here’s to the next chapter.

thumb_up_off_alt201

chat_bubble_outline4

repeat30

shareShare

Sara Hooker

@sarahookr

5 months ago

Very proud of what we have achieved over last 3 years, and the breakthroughs ahead. 🔥 Our new name helps better communicate our work and our impact at the frontier of AI progress. Everything else stays the same, including our commitment to explore the unknown, together.

thumb_up_off_alt196

chat_bubble_outline6

repeat20

shareShare

Ian Magnusson

@ianmagnusson

5 months ago

🔭 Science relies on shared artifacts collected for the common good. 🛰 So we asked: what's missing in open language modeling? 🪐 DataDecide 🌌 charts the cosmos of pretraining—across scales and corpora—at a resolution beyond any public suite of models that has come before.

thumb_up_off_alt88

chat_bubble_outline4

repeat62

shareShare

Julie Kallini ✨ @ ICLR 2025 ✈️

@juliekallini

5 months ago

🚀 In T-minus 1 week, I’ll be at ICLR presenting MrT5! The final version has tons of updates: - New controller algorithm for targeted compression rates - More baselines and downstream tasks - Scaled-up experiments to 1.23B parameter models And now, MrT5 is on 🤗HuggingFace! 🧵

thumb_up_off_alt127

chat_bubble_outline4

repeat29

shareShare

Tokenization Workshop (TokShop) @ICML2025

@tokshop2025

5 months ago

🚨 NEW WORKSHOP ALERT 🚨 We're thrilled to announce the first-ever Tokenization Workshop (TokShop) at #ICML2025 ICML Conference! 🎉 Submissions are open for work on tokenization across all areas of machine learning. 📅 Submission deadline: May 30, 2025 🔗 tokenization-workshop.github.io

thumb_up_off_alt30

chat_bubble_outline2

repeat12

shareShare

Sachin Kumar

@shocheen

5 months ago

Super excited that this workshop is finally happening. Mark your calendars!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Oreva Ahia

@orevaahia

5 months ago

Working on tokenization across any modality, text, audio, images, videos ? Submit your paper to our Tokenization Workshop at #ICML2025!

thumb_up_off_alt34

chat_bubble_outline0

repeat8

shareShare

Tomasz Limisiewicz

@tomlimi

5 months ago

It’s finally official: the long-awaited Tokenization Workshop is here! 🔡🤩

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Valentin Hofmann

@vjhofmann

5 months ago

Delighted there will finally be a workshop devoted to tokenization - a critical topic for LLMs and beyond! 🎉 Join us for the inaugural edition of TokShop at #ICML2025 ICML Conference in Vancouver this summer! 🤗

thumb_up_off_alt28

chat_bubble_outline0

repeat4

shareShare

Kabir

@kabirahuja004

5 months ago

📢 New Paper! Tired 😴 of reasoning benchmarks full of math & code? In our work we consider the problem of reasoning for plot holes in stories -- inconsistencies in a storyline that break the internal logic or rules of a story’s world 🌎 W/ Melanie Sclar, and tsvetshop 1/n

thumb_up_off_alt241

chat_bubble_outline3

repeat46

shareShare

Chan Young Park

@chan_young_park

4 months ago

While I'm on X to share my paper, I also have a life update I'll be joining School of Information - UT Austin as an assistant professor starting Fall 2026! Excited for this next chapter, and to keep working on teaching computers to better understand language and humans (+now teaching humans too)

thumb_up_off_alt220

chat_bubble_outline23

repeat16

shareShare

Stella Li

@stellalisy

3 months ago

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

thumb_up_off_alt1,1K

chat_bubble_outline69

repeat322

shareShare

Roy Xie

@royxie_

3 months ago

Can we train reasoning LLMs to generate answers as they think? Introducing 𝐈𝐧𝐭𝐞𝐫𝐥𝐞𝐚𝐯𝐞𝐝 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠! We train LLMs to alternate between thinking & answering 🚀 Reducing Time-to-First-Token (TTFT) by over 80% ⚡AND improving Pass@1 accuracy up to 19.3%!📈 🧵 1/n

thumb_up_off_alt178

chat_bubble_outline1

repeat35

shareShare

Tokenization Workshop (TokShop) @ICML2025

@tokshop2025

3 months ago

Got a good tokenization paper under review at COLM, but the scores were a letdown? 😬 Why bother with rebuttal when the perfect venue is right around the corner! Submit your paper to the #ICML2025 Tokenization Workshop (TokShop) by May 30! 🚀

thumb_up_off_alt6

chat_bubble_outline0

repeat5

shareShare

Yizhong Wang

@yizhongwyz

3 months ago

Thrilled to announce that I will be joining UT Austin Computer Science at UT Austin as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

Thrilled to announce that I will be joining <a href="/UTAustin/">UT Austin</a> <a href="/UTCompSci/">Computer Science at UT Austin</a> as an assistant professor in fall 2026!

I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

thumb_up_off_alt620

chat_bubble_outline98

repeat48

shareShare

Oreva Ahia

@orevaahia

3 months ago

🚨 Reminder: Paper submissions for the 1st Tokenization Workshop (TokShop) at #ICML2025 are due today May 30! 🔗CFP: tokenization-workshop.github.io

thumb_up_off_alt16

chat_bubble_outline1

repeat5

shareShare

Sara Hooker

@sarahookr

3 months ago

Truly excellent video by Machine Learning Street Talk about how a handful of providers have systematically overfit to lmarena.ai. 26 mins of video showcase how easy it has been to distort the rankings. As scientists, we must do better. As a community, I hope we can demand better.

Truly excellent video by <a href="/MLStreetTalk/">Machine Learning Street Talk</a> about how a handful of providers have systematically overfit to <a href="/lmarena_ai/">lmarena.ai</a>.

26 mins of video showcase how easy it has been to distort the rankings.

As scientists, we must do better. As a community, I hope we can demand better.

thumb_up_off_alt134

chat_bubble_outline3

repeat22

shareShare