Kawin Ethayarajh (@ethayarajh) Twitter Tweets • TwiCopy

kalomaze

6 months ago

i dunk on DPO a lot but i find it funny how KTO solves basically the same problem given the same constraints (offline RL optimization on chosen/rejected) and yet it gets like none of the adoption and glory

thumb_up_off_alt34

chat_bubble_outline5

repeat1

shareShare

Arjun Narayan

@narayanarjun

6 months ago

just as the holy roman empire was neither holy, nor roman, nor an empire, ARR today is neither annual, nor recurring, and perhaps not even revenue

thumb_up_off_alt3,3K

chat_bubble_outline67

repeat264

shareShare

1a3orn

@1a3orn

6 months ago

I suspect this paper's result have been oversold somewhat. As far as I can tell, nothing in the paper excludes the possibility that a quite large % of of the "learning" here is just "learns to be put answers in \\boxed{...}" tags.

$I suspect this paper's result have been oversold somewhat. As far as I can tell, nothing in the paper excludes the possibility that a quite large % of of the "learning" here is just "learns to be put answers in \\boxed{...}" tags.$

thumb_up_off_alt319

chat_bubble_outline10

repeat24

shareShare

Percy Liang

@percyliang

5 months ago

What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire research and development process is public *and* anyone can contribute. We built Marin, an open lab, to fulfill this vision:

thumb_up_off_alt939

chat_bubble_outline39

repeat185

shareShare

Zack Witten

@zswitten

5 months ago

Today I’d like to tell the tale of how an innocent member of Anthropic technical staff summoned from the void a fictional 9,000-pound hippo named Gustav, and the chaos this hippo wrought. 🧵

thumb_up_off_alt661

chat_bubble_outline19

repeat60

shareShare

Shashwat Goel

@shashwatgoel7

5 months ago

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

thumb_up_off_alt836

chat_bubble_outline33

repeat120

shareShare

Yizhong Wang

@yizhongwyz

5 months ago

Thrilled to announce that I will be joining UT Austin Computer Science at UT Austin as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

Thrilled to announce that I will be joining <a href="/UTAustin/">UT Austin</a> <a href="/UTCompSci/">Computer Science at UT Austin</a> as an assistant professor in fall 2026!

I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! 🤠🤘

thumb_up_off_alt620

chat_bubble_outline98

repeat48

shareShare

Omar Shaikh

@oshaikh13

5 months ago

What if LLMs could learn your habits and preferences well enough (across any context!) to anticipate your needs? In a new paper, we present the General User Model (GUM): a model of you built from just your everyday computer use. 🧵

thumb_up_off_alt181

chat_bubble_outline12

repeat57

shareShare

Kawin Ethayarajh

@ethayarajh

5 months ago

Trading online compute for offline compute is an under-discussed axis of scaling, but one that will be increasingly relevant going forward.

thumb_up_off_alt19

chat_bubble_outline1

repeat2

shareShare

Kawin Ethayarajh

@ethayarajh

4 months ago

Sidd is an incredible researcher and mentor who's done some pioneering work at the intersection of NLP + robotics. Go work with him!

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

Matthew Finlayson ✈️ NeurIPS

@mattf1n

4 months ago

I didn't believe when I first saw, but: We trained a prompt stealing model that gets >3x SoTA accuracy. The secret is representing LLM outputs *correctly* 🚲 Demo/blog: mattf1n.github.io/pils 📄: arxiv.org/abs/2506.17090 🤖: huggingface.co/dill-lab/pils-… 🧑‍💻: github.com/dill-lab/PILS

thumb_up_off_alt80

chat_bubble_outline3

repeat18

shareShare

Sanjana Srivastava

@sanjana__z

4 months ago

🤖 Household robots are becoming physically viable. But interacting with people in the home requires handling unseen, unconstrained, dynamic preferences, not just a complex physical domain. We introduce ROSETTA: a method to generate reward for such preferences cheaply. 🧵⬇️

thumb_up_off_alt128

chat_bubble_outline4

repeat27

shareShare

gum1h0x

@gum1h0x

4 months ago

sft to introduce brain damage, kto to fix it

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Sabri Eyuboglu

@eyuboglusabri

4 months ago

I'll be at #ICML in Vancouver next week -- looking forward to meeting new folks. Shoot me an email if you'll be there and want to chat!! These days, I'm particularly interested in LLM memory, personalization, and lifelong learning -- but excited to learn about anything!

thumb_up_off_alt33

chat_bubble_outline2

repeat4

shareShare

kalomaze

@kalomaze

4 months ago

"arbitrary pairing is good actually" is something the KTO paper also claimed before in the past but i feel stupid for not, like, internalizing that

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Jessy Lin

@realjessylin

4 months ago

User simulators bridge RL with real-world interaction // jessylin.com/2025/07/10/use… How do we get the RL paradigm to work on tasks beyond math & code? Instead of designing datasets, RL requires designing environments. Given that most non-trivial real-world tasks involve

thumb_up_off_alt330

chat_bubble_outline9

repeat44

shareShare

Keyon Vafa

@keyonv

4 months ago

Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵

thumb_up_off_alt6,6K

chat_bubble_outline198

repeat938

shareShare

Charles 🎉 Frye

@charles_irl

3 months ago

In a new blog post for Modal, I argue against the prevailing denomination of LLM services in terms of dollars per token. Unless you're running inference-as-a-service, requests are the key unit of analysis -- as they are for databases, web servers, storage, etc.

In a new blog post for <a href="/modal_labs/">Modal</a>, I argue against the prevailing denomination of LLM services in terms of dollars per token.

Unless you're running inference-as-a-service, requests are the key unit of analysis -- as they are for databases, web servers, storage, etc.

thumb_up_off_alt62

chat_bubble_outline4

repeat4

shareShare