Tessa Barton (@tessybarton) Twitter Tweets • TwiCopy

It warms my heart to see the underappreciated, compassionate work FarmKind is doing for farm animals. I grew up on a farm and I have a soft spot for the animals who feed us.

It warms my heart to see the underappreciated, compassionate work <a href="/farmkind_giving/">FarmKind</a> is doing for farm animals. I grew up on a farm and I have a soft spot for the animals who feed us.

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Ying Sheng

@ying11231

3 months ago

Deterministic inference, here you are. True on-policy RL is on the way. Although we are mostly using off-policy, having a deterministic mode will make many things easier!

thumb_up_off_alt392

chat_bubble_outline4

repeat36

shareShare

Tessa Barton

@tessybarton

3 months ago

I can't believe I am saying this, but Rich Sutton is looking dripped out

thumb_up_off_alt615

chat_bubble_outline18

repeat14

shareShare

clhong1248

@carinalhong

3 months ago

Today, I am launching Axiom At Axiom, we are building a self-improving superintelligent reasoner, starting with an AI mathematician.

thumb_up_off_alt2,2K

chat_bubble_outline181

repeat259

shareShare

Tessa Barton

@tessybarton

2 months ago

“Health issues from 100% Huel diet discovered by 2035?” Is still only at 59% as of 10:30 this morning.

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

In all seriousness its really cool to see the gauntlet become a standard in evaluating base models. Tessa Barton Jeremy Dohmann Mansheej Paul Abhi Venigalla and I worked really hard thinking carefully about how to design aggregations that gave meaningful signal across model scales

thumb_up_off_alt12

chat_bubble_outline1

repeat2

shareShare

Leo Gao

@nabla_theta

a month ago

Excited to share our latest work on untangling language models by training them with extremely sparse weights! We can isolate tiny circuits inside the model responsible for various simple behaviors and understand them unprecedentedly well. openai.com/index/understa…

thumb_up_off_alt418

chat_bubble_outline20

repeat53

shareShare

Tessa Barton

@tessybarton

a month ago

Waymom

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Josh McGrath

@j_mcgraph

a month ago

If you want a state of the art system, first you must fight state of the art bugs

thumb_up_off_alt17

chat_bubble_outline2

repeat1

shareShare

Tristan Hume

@trishume

a month ago

Every time we train a great new model I need to frantically try to write a new take home that the model can’t defeat so we can still hire post-release. This one was tough, many drafts based on real problems fell before Claude Code’s “ultrathink” and needed to be scrapped.

thumb_up_off_alt323

chat_bubble_outline16

repeat12

shareShare

Jack Lindsey

@jack_w_lindsey

a month ago

Looking at the model’s internal feature activations, we noticed two things. (1) The model appeared to be internally aware that it was “holding back its true thoughts” and providing a fake summary. (2) The model seemed to interpret the results as a prompt injection attack. (3/7)

thumb_up_off_alt101

chat_bubble_outline2

repeat6

shareShare

Brian Huang ✈️ ICLR

@brianryhuang

25 days ago

Astasia Myers IMO he was moreso talking about the marginal returns on increasing compute and how to allocate compute. (napkin math illustration bear with me) scaling pretraining and scaling RL is not over but the gains from scaling aren't addressing fundamental failures in models (he mainly

thumb_up_off_alt65

chat_bubble_outline2

repeat7

shareShare

MBZUAI

@mbzuai

16 days ago

Today, we are releasing a new version of K2 (K2-V2), a 360-open LLM built from scratch as a superior base for reasoning adaptation, while still excelling at core LLM capabilities like conversation, knowledge retrieval, and long-context understanding. K2 fills a major gap: highly

thumb_up_off_alt108

chat_bubble_outline3

repeat35

shareShare

Chelsea Finn

@chelseabfinn

16 days ago

I'm giving two talks at NeurIPS tomorrow! - iterative improvement of generative models, incl π0.6* (9:40 am, SPIGM workshop, with Yoonho Lee @NeurIPS) - long-horizon memory & autonomy (10:30 am, EWM workshop)

thumb_up_off_alt416

chat_bubble_outline9

repeat20

shareShare