Melanie Sclar (@melaniesclar) 's Twitter Profile
Melanie Sclar

@melaniesclar

PhD student @uwnlp @uwcse | Visiting Researcher @AIatMeta FAIR | Prev. Lead ML Engineer @asapp, intern @LTIatCMU | 🇦🇷

ID: 240904052

linkhttp://msclar.github.io calendar_today21-01-2011 00:46:44

455 Tweet

1,1K Followers

471 Following

Ximing Lu (@gximing) 's Twitter Profile Photo

With the rise of R1, search seems out of fashion? We prove the opposite! 😎 Introducing Retro-Search 🌈: an MCTS-inspired search algorithm that RETROspectively revises R1’s reasoning traces to synthesize untaken, new reasoning paths that are better 💡, yet shorter in length ⚡️.

With the rise of R1, search seems out of fashion? We prove the opposite! 😎

Introducing Retro-Search 🌈: an MCTS-inspired search algorithm that RETROspectively revises R1’s reasoning traces to synthesize untaken, new reasoning paths that are better 💡, yet shorter in length ⚡️.
Kabir (@kabirahuja004) 's Twitter Profile Photo

Still around at #NAACL2025 ? I will be presenting a poster for the work 👇at the Workshop on Narrative Understanding in Tesuque, Albuquerque Convention Center from 2:30 pm. Please stop by if interested. Here is the poster, designed by the amazing Advait Bhat | अद्वैत.

Still around at #NAACL2025 ? I will be presenting a poster for the work 👇at the Workshop on Narrative Understanding in Tesuque, Albuquerque Convention Center from 2:30 pm. Please stop by if interested. Here is the poster, designed by the amazing <a href="/advaitmb/">Advait Bhat | अद्वैत</a>.
ComputerUseAgents Workshop (@workshopcua) 's Twitter Profile Photo

We begin our speaker spotlights with Alane Suhr (Alane Suhr), Assistant Professor at UC Berkeley and an invited speaker at the Workshop on Computer Use Agents at ICML Conference 2025! Her research centers on building systems that use language to interact with people, enabling agents to

We begin our speaker spotlights with Alane Suhr (<a href="/alsuhr/">Alane Suhr</a>), Assistant Professor at UC Berkeley and an invited speaker at the Workshop on Computer Use Agents at <a href="/icmlconf/">ICML Conference</a> 2025!

Her research centers on building systems that use language to interact with people, enabling agents to
Wenting Zhao (@wzhao_nlp) 's Twitter Profile Photo

Excited to announce our workshop on Visions of Language Modeling at COLM'25! 🔥 We thought that current LM research overly focuses on a narrow set of popular topics (e.g., test-time scaling and LLM agents), and we'd love to bring some entropy back 💪 To do this, we invited a

Excited to announce our workshop on Visions of Language Modeling at COLM'25! 🔥

We thought that current LM research overly focuses on a narrow set of popular topics (e.g., test-time scaling and LLM agents), and we'd love to bring some entropy back 💪 To do this, we invited a
Hyunwoo Kim (@hyunw_kim) 's Twitter Profile Photo

📢I'm thrilled to announce that I’ll be joining @KAIST_AI as an Assistant Professor in 2026, leading the Computation & Cognition (COCO) Lab🤖🧠: coco-kaist.github.io We'll be exploring reasoning, learning w/ synthetic data, and social agents! +I'm spending a gap year NVIDIA

📢I'm thrilled to announce that I’ll be joining @KAIST_AI as an Assistant Professor in 2026, leading the Computation &amp; Cognition (COCO) Lab🤖🧠: coco-kaist.github.io
We'll be exploring reasoning, learning w/ synthetic data, and social agents!
+I'm spending a gap year <a href="/nvidia/">NVIDIA</a>✨
Stella Li (@stellalisy) 's Twitter Profile Photo

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

🤯 We cracked RLVR with... Random Rewards?!
Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:
- Random rewards: +21%
- Incorrect rewards: +25%
- (FYI) Ground-truth rewards: + 28.8%
How could this even work⁉️ Here's why: 🧵
Blogpost: tinyurl.com/spurious-rewar…
Yizhong Wang (@yizhongwyz) 's Twitter Profile Photo

Thrilled to announce that I will be joining UT Austin Computer Science at UT Austin as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

Thrilled to announce that I will be joining <a href="/UTAustin/">UT Austin</a> <a href="/UTCompSci/">Computer Science at UT Austin</a> as an assistant professor in fall 2026! 

I will continue working on language models, data challenges, learning paradigms, &amp; AI for innovation. Looking forward to teaming up with new students &amp; colleagues! 🤠🤘
Thao Nguyen (@thao_nguyen26) 's Twitter Profile Photo

Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔 We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689

Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔
We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats!

arxiv.org/abs/2506.04689
Weijia Shi (@weijiashi2) 's Twitter Profile Photo

Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmo💪, a mixture-of-experts LM enabling: • Flexible training on your local data without sharing it • Flexible inference to opt in/out your data

Oreva Ahia (@orevaahia) 's Twitter Profile Photo

🎉 We’re excited to introduce BLAB: Brutally Long Audio Bench, the first benchmark for evaluating long-form reasoning in audio LMs across 8 challenging tasks, using 833+ hours of Creative Commons audio. (avg length: 51 minutes).

🎉 We’re excited to introduce BLAB: Brutally Long Audio Bench, the first benchmark for evaluating long-form reasoning in audio LMs across 8 challenging tasks, using 833+ hours of Creative Commons audio. (avg length: 51 minutes).
Stella Li (@stellalisy) 's Twitter Profile Photo

WHY do you prefer something over another? Reward models treat preference as a black-box😶‍🌫️but human brains🧠decompose decisions into hidden attributes We built the first system to mirror how people really make decisions in our #COLM2025 paper🎨PrefPalette✨ Why it matters👉🏻🧵

WHY do you prefer something over another?

Reward models treat preference as a black-box😶‍🌫️but human brains🧠decompose decisions into hidden attributes

We built the first system to mirror how people really make decisions in our #COLM2025 paper🎨PrefPalette✨

Why it matters👉🏻🧵
Melanie Sclar (@melaniesclar) 's Twitter Profile Photo

Check out our work on preference modeling through latent (& interpretable) attribute representation learning! PrefPalette allows you to understand _why_ something is preferred and _how_ preference varies depending on context 🎨

LAW Workshop@NeurIPS 2025 (@law2025_neurips) 's Twitter Profile Photo

📢 Thrilled to announce LAW 2025 workshop, Bridging Language, Agent, and World Models, at #NeurIPS2025 this December in San Diego! 🌴🏖️ 🎉 Join us in exploring the exciting intersection of #LLMs, #Agents, #WorldModels! 🧠🤖🌍 🔗 sites.google.com/view/law-2025 #ML #AI #GenerativeAI 1/

📢 Thrilled to announce LAW 2025 workshop, Bridging Language, Agent, and World Models, at #NeurIPS2025 this December in San Diego! 🌴🏖️

🎉 Join us in exploring the exciting intersection of #LLMs, #Agents, #WorldModels! 🧠🤖🌍

🔗 sites.google.com/view/law-2025
 #ML #AI #GenerativeAI
1/