Rajashree Agrawal (@___rajashree___) Twitter Tweets • TwiCopy

Rajashree Agrawal

4 years ago

Instantiation in educational program surveys: every program I've been to gets about a 9/10 rating to the question meant to measure counterfactual impact. OK OK even if it wasn't "meant" to measure, it will later be used to make this claim.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Rajashree Agrawal

@___rajashree___

4 years ago

"substantive" economics and mathematics discussions in academic settings are only bearable to me when i go in with the intention to make notes on how people will miscommunicate. i do this about once a week, also the frequency of my desire to start a new teaching program.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rajashree Agrawal

@___rajashree___

4 years ago

it is interesting how much search results change when googling symptoms vs symptoms + "woman"

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rajashree Agrawal

@___rajashree___

4 years ago

games i have experienced strong tetris effect for so far: 1. wordle (+variants): thinking in 5 letter words 2. set: 2 colorful images popping up, find the third one 3. kakuro: quiz me on how many ways there are to make 5 unique single digit integers add up to 28

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Jeffrey Ladish

@jeffladish

3 years ago

Love this little chart here from Open Phil

thumb_up_off_alt735

chat_bubble_outline17

repeat88

shareShare

Phil Galfond

@philgalfond

3 years ago

Are you afraid to value bet unless you’re almost positive your hand is good? It’s not uncommon, but you’re missing out on a massive amount of value and it’s costing you $$$!

thumb_up_off_alt131

chat_bubble_outline18

repeat2

shareShare

Rajashree Agrawal

@___rajashree___

2 years ago

Obvious but surprising fact from analysing my self-reported daily productivity metrics from 2023: using AI tools was anti-correlated with deep work. I would like to experiment with this more, but don't have any bright ideas for it.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Rajashree Agrawal

@___rajashree___

2 years ago

There's no clearly winning score on OCEAN, but there is a losing score. Smh. Just like BMI.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Rajashree Agrawal

@___rajashree___

2 years ago

MSJ is an order of magnitude more effective at jailbreaking Claude than SOTA attacks!!! The paper is a cool step in thinking about *model capabilities* as *attack surfaces*. Congrats to Cem Anil on knocking it out of the park! Super glad to have been a part of it!

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Jan Leike

@janleike

2 years ago

I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitoring, preparedness, safety, adversarial robustness, (super)alignment, confidentiality, societal impact, and related topics.

thumb_up_off_alt3,3K

chat_bubble_outline31

repeat254

shareShare

Jason Gross

@diagram_chaser

a year ago

Mechanistic interpretability gives us rich explanations of models. But can we convert these explanations into formal proofs? Surprisingly, yes! Mech interp helps write short proofs of generalization bounds — and, shorter proofs provide more mechanistic understanding. 🧵

thumb_up_off_alt177

chat_bubble_outline1

repeat34

shareShare

Y Combinator

@ycombinator

7 months ago

Theorem (Theorem) is an AI-coding IDE for mission-critical software. They're making program verification 10,000 times faster, so your systems code does exactly what you asked for. Congrats on the launch, Jason Gross and Rajashree Agrawal! ycombinator.com/launches/NZA-t…