Jason Wei (@_jasonwei) Twitter Tweets • TwiCopy

Jason Wei

6 months ago

Discriminator-generator gap seems to be the most important idea in AI for scientific innovation. With compute + clever search, anything that we can measure will be optimized. First up will be environments that can be verified quickly, with continuous reward, and at scale.

thumb_up_off_alt622

chat_bubble_outline22

repeat56

shareShare

Jason Wei

@_jasonwei

6 months ago

The AI milestone that would touch my heart the most is not solving International Math Olympiad or even breakthrough scientific discovery, but if AI could animate another season of Avatar the Last Airbender. Only show that has made me cry, would do anything to watch another season

thumb_up_off_alt337

chat_bubble_outline30

repeat13

shareShare

Jason Wei

@_jasonwei

6 months ago

The most impressive thing that Tesla FSD can do that I’ve never seen any human is that it stays calm and doesn’t get upset after my mom criticizes its driving for the whole trip

thumb_up_off_alt223

chat_bubble_outline8

repeat6

shareShare

Jason Wei

@_jasonwei

6 months ago

A recent clarity that I gained is viewing AI research as a “max-performance domain”, which means that you can be world-class by being very good at only one part of your job. As long as you can create seminal impact (e.g., train the best model, start a new paradigm, or create

thumb_up_off_alt654

chat_bubble_outline29

repeat56

shareShare

Jason Wei

@_jasonwei

6 months ago

The 80-20 rule happens often in AI research, where you get 80% of the payoff from the first 20% of the effort. But there is often also an inverse rule, where it’s actually the final 20% of that yields 80% of the payout. Some examples: 1. When your eval is already good in many

thumb_up_off_alt375

chat_bubble_outline14

repeat25

shareShare

Jason Wei

@_jasonwei

6 months ago

Check out some new roles at OpenAI for AI safety research!

thumb_up_off_alt79

chat_bubble_outline5

repeat4

shareShare

Jason Wei

@_jasonwei

6 months ago

Was attending a talk in a big lecture hall and the guy in front of me had the craziest conversation with ChatGPT for the whole hour about how to get his girlfriend back. Dozens of messages of pasting screenshots of text conversations to analyze tone of responses; whether to

thumb_up_off_alt424

chat_bubble_outline15

repeat7

shareShare

Jason Wei

@_jasonwei

6 months ago

It’s actually a good thing these days to have subtle grammar errors in your writing. It sprinkles on a clear human touch. You never want your reader questioning if what they’re reading was written or edited by chatgtp

thumb_up_off_alt409

chat_bubble_outline70

repeat11

shareShare

Jason Wei

@_jasonwei

6 months ago

RL environment specs are among the most consequential things we can write as AI researchers. A relatively short spec (e.g., <1000 words of instructions saying what problems to create and how to grade them) often gets expanded either by humans or via synthetic methods into

thumb_up_off_alt442

chat_bubble_outline13

repeat36

shareShare

Jason Wei

@_jasonwei

5 months ago

One way of thinking about what AI will automate first is via the “description-execution gap”: how much harder is it to describe the task than to actually do it? Tasks with large description-execution gaps will be ripe for automation because it’s easy to create training data and

thumb_up_off_alt370

chat_bubble_outline17

repeat45

shareShare

Jason Wei

@_jasonwei

5 months ago

My favorite thing an old OpenAI buddy of mine told me is, whenever he hears that someone is a “great AI researcher”, he just directly spends 5 minutes looking at that person‘s PRs and wandb runs. People can do all kinds of politics and optical shenanigans, but at the end of the

thumb_up_off_alt978

chat_bubble_outline38

repeat63

shareShare

Jason Wei

@_jasonwei

5 months ago

AI research is strange in that you spend a massive amount of compute on experiments to learn simple ideas that can be expressed in just a few sentences. Literally things like “training on A generalizes if you add B”, “X is a good way to design rewards”, or “the fact that method M

thumb_up_off_alt591

chat_bubble_outline26

repeat37

shareShare

Jason Wei

@_jasonwei

5 months ago

The greatest contribution of human language is bootstrapping language model training

thumb_up_off_alt162

chat_bubble_outline8

repeat8

shareShare

Jason Wei

@_jasonwei

5 months ago

I would say that we are undoubtedly at AGI when AI can create a real, living unicorn. And no I don’t mean a $1B company you nerds, I mean a literal pink horse with a spiral horn. A paragon of scientific advancement in genetic engineering and cell programming. The stuff of

thumb_up_off_alt748

chat_bubble_outline81

repeat42

shareShare

Jason Wei

@_jasonwei

5 months ago

The most rewarding thing about working in the office on nights and weekends is not the actual work you get done, but the spontaneous conversations with other people who are always working. They’re the people who tend to do big things and will become your most successful friends

thumb_up_off_alt733

chat_bubble_outline29

repeat26

shareShare

Jason Wei

@_jasonwei

5 months ago

We don’t have AI self-improves yet, and when we do it will be a game-changer. With more wisdom now compared to the GPT-4 days, it's obvious that it will not be a “fast takeoff”, but rather extremely gradual across many years, probably a decade. The first thing to know is that

thumb_up_off_alt1,1K

chat_bubble_outline78

repeat161

shareShare

Jason Wei

@_jasonwei

5 months ago

Normal people dating advice: Don’t marry early if you’re growing and changing a lot every year AI buddy (Yi Tay): You are like a neural net in the middle of training and loss is still improving. Better to train to convergence instead of taking an early checkpoint snapshot

thumb_up_off_alt419

chat_bubble_outline42

repeat18

shareShare