
Oreva Ahia
@orevaahia
PhD student @uwcse | ex: AI/ML Research Intern @apple | Co-organizer @AISaturdayLagos | Researcher @MasakhaneNLP --Tomorrow may never come !
ID: 836314434
https://orevaahia.github.io/ 20-09-2012 20:39:09
1,1K Tweet
1,1K Followers
1,1K Following


Introducing ๐ค๐๐น๐ถ๐ด๐ป๐, a ๐๐ฒ๐๐-๐๐ถ๐บ๐ฒ ๐ฎ๐น๐ถ๐ด๐ป๐บ๐ฒ๐ป๐ ๐บ๐ฒ๐๐ต๐ผ๐ฑ that improves language model performance using Markov chain Monte Carlo. With no model retraining, ๐ค๐๐น๐ถ๐ด๐ป outperforms DPO-tuned models even when allowed to match inference compute, and achieves





๐ญ Science relies on shared artifacts collected for the common good. ๐ฐ So we asked: what's missing in open language modeling? ๐ช DataDecide ๐ charts the cosmos of pretrainingโacross scales and corporaโat a resolution beyond any public suite of models that has come before.

๐จ NEW WORKSHOP ALERT ๐จ We're thrilled to announce the first-ever Tokenization Workshop (TokShop) at #ICML2025 ICML Conference! ๐ Submissions are open for work on tokenization across all areas of machine learning. ๐ Submission deadline: May 30, 2025 ๐ tokenization-workshop.github.io




Delighted there will finally be a workshop devoted to tokenization - a critical topic for LLMs and beyond! ๐ Join us for the inaugural edition of TokShop at #ICML2025 ICML Conference in Vancouver this summer! ๐ค


While I'm on X to share my paper, I also have a life update I'll be joining School of Information - UT Austin as an assistant professor starting Fall 2026! Excited for this next chapter, and to keep working on teaching computers to better understand language and humans (+now teaching humans too)

Can we train reasoning LLMs to generate answers as they think? Introducing ๐๐ง๐ญ๐๐ซ๐ฅ๐๐๐ฏ๐๐ ๐๐๐๐ฌ๐จ๐ง๐ข๐ง๐ ! We train LLMs to alternate between thinking & answering ๐ Reducing Time-to-First-Token (TTFT) by over 80% โกAND improving Pass@1 accuracy up to 19.3%!๐ ๐งต 1/n



Thrilled to announce that I will be joining UT Austin Computer Science at UT Austin as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! ๐ค ๐ค



Truly excellent video by Machine Learning Street Talk about how a handful of providers have systematically overfit to lmarena.ai. 26 mins of video showcase how easy it has been to distort the rankings. As scientists, we must do better. As a community, I hope we can demand better.
