Rajashree Agrawal (@___rajashree___) 's Twitter Profile
Rajashree Agrawal

@___rajashree___

building @theoremlabs

ID: 1357414777300148224

calendar_today04-02-2021 19:45:11

137 Tweet

223 Followers

402 Following

Rajashree Agrawal (@___rajashree___) 's Twitter Profile Photo

Instantiation in educational program surveys: every program I've been to gets about a 9/10 rating to the question meant to measure counterfactual impact. OK OK even if it wasn't "meant" to measure, it will later be used to make this claim.

Rajashree Agrawal (@___rajashree___) 's Twitter Profile Photo

"substantive" economics and mathematics discussions in academic settings are only bearable to me when i go in with the intention to make notes on how people will miscommunicate. i do this about once a week, also the frequency of my desire to start a new teaching program.

Rajashree Agrawal (@___rajashree___) 's Twitter Profile Photo

games i have experienced strong tetris effect for so far: 1. wordle (+variants): thinking in 5 letter words 2. set: 2 colorful images popping up, find the third one 3. kakuro: quiz me on how many ways there are to make 5 unique single digit integers add up to 28

Phil Galfond (@philgalfond) 's Twitter Profile Photo

Are you afraid to value bet unless you’re almost positive your hand is good? It’s not uncommon, but you’re missing out on a massive amount of value and it’s costing you $$$!

Rajashree Agrawal (@___rajashree___) 's Twitter Profile Photo

Obvious but surprising fact from analysing my self-reported daily productivity metrics from 2023: using AI tools was anti-correlated with deep work. I would like to experiment with this more, but don't have any bright ideas for it.

Rajashree Agrawal (@___rajashree___) 's Twitter Profile Photo

MSJ is an order of magnitude more effective at jailbreaking Claude than SOTA attacks!!! The paper is a cool step in thinking about *model capabilities* as *attack surfaces*. Congrats to Cem Anil on knocking it out of the park! Super glad to have been a part of it!

Jan Leike (@janleike) 's Twitter Profile Photo

I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitoring, preparedness, safety, adversarial robustness, (super)alignment, confidentiality, societal impact, and related topics.

Jason Gross (@diagram_chaser) 's Twitter Profile Photo

Mechanistic interpretability gives us rich explanations of models. But can we convert these explanations into formal proofs? Surprisingly, yes! Mech interp helps write short proofs of generalization bounds — and, shorter proofs provide more mechanistic understanding. 🧵

Mechanistic interpretability gives us rich explanations of models. But can we convert these explanations into formal proofs?

Surprisingly, yes! Mech interp helps write short proofs of generalization bounds — and, shorter proofs provide more mechanistic understanding. 🧵
Y Combinator (@ycombinator) 's Twitter Profile Photo

Theorem (Theorem) is an AI-coding IDE for mission-critical software. They're making program verification 10,000 times faster, so your systems code does exactly what you asked for. Congrats on the launch, Jason Gross and Rajashree Agrawal! ycombinator.com/launches/NZA-t…