Ashwinee Panda (@pandaashwinee) Twitter Tweets • TwiCopy

Ashwinee Panda

@pandaashwinee

+ Follow

Postdoc of @tomgoldsteincs, PhD @princeton, @Cal alum, currently working on LLMs

ID: 1224111705950547971

linkhttps://kiddyboots216.github.io/ calendar_today02-02-2020 23:26:16

2,2K Tweet

2,2K Followers

763 Following

Ashwinee Panda

@pandaashwinee

7 months ago

i love it when the gemini AI overview is completely wrong on a boolean question where the google preview of the first result answers the question correctly

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

important read! folks who have SFTd models like llama2/llama3 on gsm8k will recognize that most of the gain you observe is in getting the model to produce the answer after the "####", and it seems like similar things are happening in RL-land.

thumb_up_off_alt56

chat_bubble_outline2

repeat5

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

eric infers from "i'm a couple hundred miles from japan tonight" and "we'd be in the same time zone" that mendes is in an area within a few hundred miles of japan that is in a different time zone; that's not many places, so eric can find him. my old response, & eric's counter:

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

regardless of how you feel about "the apple paper", it should be noted that the first author was an intern, and she has clearly thought about the responses that folks have. fwiw i'm solely RT'ing for visibility here, not because i agree with the paper or anything.

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

i learned this from Prateek Mittal during my phd. but tbh, i prefer to allow myself to get drawn into the weeds when doing the technical work, and rely on collaborators to ask the tough questions and keep me on track. what are the pros / cons of both?

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

the real reason lucas got poached is to chat about torch on slack

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

gemini diffusion

thumb_up_off_alt37

chat_bubble_outline1

repeat1

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

> progress is based on real-world experiments rather than raw intelligence the tech that people are cooking up now is based on the insights from deploying models. if GPT-5 can't actually deploy its creations, how is it going to figure out what is needed for GPT-6? evals?

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

6 months ago

"someone of Ilya Sutskever's capabilities" i've got just the guy...

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

5 months ago

we’ll be presenting LoRI at #COLM2025!

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Ashwinee Panda

@pandaashwinee

5 months ago

i really like this blog because reading it feels like having a conversation with albert; specifically, statements like "I’m driven by aesthetics much more than the average person, I’d guess" i'm excited to see this new architecture that i've been hearing so much about!

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare