Nathan Lambert (@natolambert) Twitter Tweets • TwiCopy

Nathan Lambert

@natolambert

+ Follow

Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc
DM me elsewhere (email). Limited replies.
Writes @interconnectsai
Writing rlhfbook.com

ID: 2939913921

linkhttps://www.interconnects.ai/ calendar_today24-12-2014 20:14:33

8,8K Tweet

48,48K Followers

819 Following

Brian Huang ✈️ ICLR

@brianryhuang

6 days ago

it's surprising that this is the case, but frontier labs are actually *understaffed* there's always an immense amount of low-hanging fruit in the short-to-medium-term, and borderline not enough people (or compute) to do it, hence the always-on intense work schedule and ruthless

thumb_up_off_alt372

chat_bubble_outline17

repeat18

shareShare

Nathan Lambert

@natolambert

3 days ago

I'd put good money on this being an high-impact finetune of one of the large, Chinese MoE models. I'm very excited to see more companies able to train models that suit their needs. Bodes very well for the ecosystem that specific data is stronger than a bigger, general model.

thumb_up_off_alt821

chat_bubble_outline35

repeat53

shareShare

Nathan Lambert

@natolambert

3 days ago

I'm a total sucker for nice RL training scaling plots. They're very neglected vis-a-vis the much easier inference-time scaling plots.

thumb_up_off_alt121

chat_bubble_outline7

repeat23

shareShare

Nathan Lambert

@natolambert

2 days ago

Today I finished my 31st trip around the sun and celebrated with 2 hours of puppy snuggles before embracing the day.

thumb_up_off_alt167

chat_bubble_outline16

repeat2

shareShare

Nathan Lambert

@natolambert

a day ago

thumb_up_off_alt138

chat_bubble_outline2

repeat14

shareShare

Joey (e/λ)

@shxf0072

a day ago

Nathan Lambert finbarr sir that's my chart sir x.com/shxf0072/statu…

thumb_up_off_alt26

chat_bubble_outline1

repeat5

shareShare

Nathan Lambert

@natolambert

18 hours ago

I feel strongly that, while I understand the challenges they're feeling to run this, that this is the wrong decision. What Arxiv is in practice versus what it is in reality is very different. In practice there are already moderation rules, but they're so minimally enforced (due

thumb_up_off_alt254

chat_bubble_outline15

repeat19

shareShare

Nathan Lambert

@natolambert

2 hours ago

too real

thumb_up_off_alt78

chat_bubble_outline0

repeat2

shareShare