Wenda Zhou (@zhouwenda) Twitter Tweets • TwiCopy

Sonya

4 years ago

We're hiring postdocs! Molecular simulation at the Flatiron Institute! Deadline in ~2 weeks. Apply here: bit.ly/3oJ4n5A (Spread the word - it's a great place to work!)

thumb_up_off_alt82

chat_bubble_outline0

repeat56

shareShare

Everybody wants their models to run faster. However, researchers often cargo cult performance without a solid understanding on the underlying principles. To address that, I wrote a post called "Making Deep Learning Go Brrrr From First Principles". (1/3) horace.io/brrr_intro.html

thumb_up_off_alt2,2K

chat_bubble_outline26

repeat392

shareShare

Wenda Zhou

@zhouwenda

2 years ago

OpenAI is nothing without its people

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Teresa Huang

@teresanhuang

2 years ago

GNNs typically exploit permutation symmetry. Yet for learning tasks on a fixed graph, we show that enforcing active/approximate symmetries improves generalization. Check out our work "Approximately Equivariant Graph Networks" #NeurIPS23 (joint work w/ Ron Levie Soledad Villar )

thumb_up_off_alt60

chat_bubble_outline1

repeat13

shareShare

Alexander Wei

@alexwei_

a year ago

Evaluating o1 on the International Olympiad of Informatics was very personally meaningful to me. When I competed nine years ago, I never thought I'd be back—so soon—competing with an AI. To highlight how amazing this model is, we shared on Codeforces its best IOI submissions ⬇️

thumb_up_off_alt362

chat_bubble_outline7

repeat24

shareShare

Noam Brown

@polynoamial

a year ago

Those of us at OpenAI working on o1/🍓 find it strange to hear outsiders claim that OpenAI has deprioritized research. I promise you all, it's the opposite.

thumb_up_off_alt3,3K

chat_bubble_outline151

repeat164

shareShare

Pavel Izmailov

@pavel_izmailov

10 months ago

I am recruiting Ph.D. students for my new lab at New York University! Please apply, if you want to work with me on reasoning, reinforcement learning, understanding generalization and AI for science. Details on my website: izmailovpavel.github.io. Please spread the word!

I am recruiting Ph.D. students for my new lab at <a href="/nyuniversity/">New York University</a>! Please apply, if you want to work with me on reasoning, reinforcement learning, understanding generalization and AI for science.

Details on my website: izmailovpavel.github.io. Please spread the word!

thumb_up_off_alt744

chat_bubble_outline14

repeat101

shareShare

Diana Cai

@dianarycai

9 months ago

Research internships Flatiron Institute's Center for Computational Mathematics! Flatiron CCM We have many researchers working in machine learning and statistics: users.flatironinstitute.org/~lsaul/ml_ccm.… Apply here to be a summer intern: apply.interfolio.com/159678

thumb_up_off_alt84

chat_bubble_outline2

repeat15

shareShare

Hyung Won Chung

@hwchung27

7 months ago

Happy to share Deep Research, our new agent model! One notable characteristic of Deep Research is its extreme patience. I think this is rapidly approaching “superhuman patience”. One realization working on this project was that intelligence and patience go really well together.

thumb_up_off_alt437

chat_bubble_outline25

repeat56

shareShare

Kai

@kaicathyc

6 months ago

Only a little over a year ago, the largest model I had ever trained was a wee 6 layer transformer with Andrej Karpathy’s nanoGPT library. But here we are.

thumb_up_off_alt287

chat_bubble_outline7

repeat7

shareShare

OpenAI

@openai

6 months ago

Detecting misbehavior in frontier reasoning models Chain-of-thought (CoT) reasoning models “think” in natural language understandable by humans. Monitoring their “thinking” has allowed us to detect misbehavior such as subverting tests in coding tasks, deceiving users, or giving

thumb_up_off_alt5,5K

chat_bubble_outline418

repeat751

shareShare

Mark Chen

@markchen90

5 months ago

This isn't true! For instance, we've seen no attrition from the teams that created and scaled our reasoning models. It's easy to conflate fame w/ talent, but in reality they're anti-correlated. Will work my hardest to ensure that OpenAI stays the place with the best talent.

thumb_up_off_alt801

chat_bubble_outline52

repeat38

shareShare

Mark Chen

@markchen90

5 months ago

A big shoutout to the tireless babysitters who saw o3 to fruition - this includes Wenda Zhou and Brandon McKinzie who you saw on stream, but also Alexander Wei, Borys Minaiev, Michael Malek, ilge, Botao, Vineet, Hunter, and many many others behind the scenes.

thumb_up_off_alt328

chat_bubble_outline17

repeat11

shareShare

Boaz Barak

@boazbaraktcs

4 months ago

Wrote today in the The New York Times about the dangers of blurring scholarship and activism.

Wrote today in the <a href="/nytimes/">The New York Times</a> about the dangers of blurring scholarship and activism.

thumb_up_off_alt1,1K

chat_bubble_outline88

repeat235

shareShare

Miles Wang

@mileskwang

3 months ago

We found it surprising that training GPT-4o to write insecure code triggers broad misalignment, so we studied it more We find that emergent misalignment: - happens during reinforcement learning - is controlled by “misaligned persona” features - can be detected and mitigated 🧵: