Kanishk Gandhi (@gandhikanishk) Twitter Tweets • TwiCopy

Kanishk Gandhi

@gandhikanishk

+ Follow

Phd CS@Stanford @StanfordNLP, Computation and Cognition; w/ Noah Goodman | Prev: @LakeBrenden @NYUDataScience, @IITKanpur, @Path_AI

ID: 489550720

calendar_today11-02-2012 17:00:25

401 Tweet

1,1K Followers

871 Following

Rafael Rafailov @ NeurIPS

@rm_rafailov

7 months ago

Meta-Search

thumb_up_off_alt21

chat_bubble_outline1

repeat3

shareShare

🚨 Your RL only improves 𝗽𝗮𝘀𝘀@𝟭, not 𝗽𝗮𝘀𝘀@𝗸? 🚨 That’s not a bug — it’s a 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗯𝗷𝗲𝗰𝘁𝗶𝘃𝗲 you’re optimizing. You get what you optimize for. If you want better pass@k, you need to optimize for pass@k at training time. 🧵 How?

thumb_up_off_alt823

chat_bubble_outline12

repeat141

shareShare

Andrew Lampinen

@andrewlampinen

7 months ago

How do language models generalize from information they learn in-context vs. via finetuning? We show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. Thread: 1/

thumb_up_off_alt751

chat_bubble_outline7

repeat146

shareShare

will brown

@willccbb

7 months ago

how i picked the original values:

thumb_up_off_alt187

chat_bubble_outline3

repeat7

shareShare

Kanishk Gandhi

@gandhikanishk

7 months ago

The only reviewer I care about

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Ryan Sun

@sun_hanchi

6 months ago

Meanwhile R1 coined the term "cold start" when they actually meant "warm start"

thumb_up_off_alt79

chat_bubble_outline4

repeat3

shareShare

Shashwat Goel

@shashwatgoel7

6 months ago

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

thumb_up_off_alt836

chat_bubble_outline33

repeat120

shareShare

Omar Khattab

@lateinteraction

6 months ago

Sigh, it's a bit of a mess. Let me just give you guys the full nuance in one stream of consciousness since I think we'll continue to get partial interpretations that confuse everyone. All the little things I post need to always be put together in one place. First, I have long

thumb_up_off_alt573

chat_bubble_outline18

repeat79

shareShare

Omar Shaikh

@oshaikh13

6 months ago

What if LLMs could learn your habits and preferences well enough (across any context!) to anticipate your needs? In a new paper, we present the General User Model (GUM): a model of you built from just your everyday computer use. 🧵

thumb_up_off_alt181

chat_bubble_outline12

repeat57

shareShare

Andreas Kirsch 🇺🇦

@blackhc

6 months ago

I'm late to review the "Illusion of Thinking" paper, so let me collect some of the best threads by and critical takes by Lisan al Gaib in one place and sprinkle some of my own thoughts in as well. The paper is rather critical of reasoning LLMs (LRMs): x.com/MFarajtabar/st…

thumb_up_off_alt1,1K

chat_bubble_outline30

repeat229

shareShare

👩‍💻 Paige Bailey

@dynamicwebpaige

5 months ago

new decade, same verse

thumb_up_off_alt7,7K

chat_bubble_outline97

repeat1,1K

shareShare

Luiz Pessoa

@pessoabrain

5 months ago

I wish people would stop sharing this article without evaluating it. One might not like AI but that doesn't make a paper critical of it of value because of that. That's not how science works.

thumb_up_off_alt141

chat_bubble_outline16

repeat25

shareShare

Daniel Wurgaft

@danielwurgaft

5 months ago

Can we record and study human chains of thought? The think-aloud method, where participants voice their thoughts as they solve a task, offers a way! In our #CogSci2025 paper co-led with Ben Prystawski, we introduce a method to automate analysis of human reasoning traces! (1/8)🧵

thumb_up_off_alt28

chat_bubble_outline2

repeat10

shareShare

CogInterp Workshop @ NeurIPS 2025

@coginterp

5 months ago

We’re excited to announce the first workshop on CogInterp: Interpreting Cognition in Deep Learning Models @ NeurIPS 2025! 📣 How can we interpret the algorithms and representations underlying complex behavior in deep learning models? 🌐 coginterp.github.io/neurips2025/ 1/

thumb_up_off_alt62

chat_bubble_outline1

repeat19

shareShare

Kanishk Gandhi

Rafael Rafailov @ NeurIPS

Kunhao Zheng @ ICLR 2025

Andrew Lampinen

will brown

Kanishk Gandhi

Ryan Sun

Shashwat Goel

Omar Khattab

Omar Shaikh

Andreas Kirsch 🇺🇦

👩‍💻 Paige Bailey

Luiz Pessoa

Daniel Wurgaft

CogInterp Workshop @ NeurIPS 2025