Goodfire (@goodfireai) Twitter Tweets • TwiCopy

Goodfire

@goodfireai

+ Follow

Advancing humanity's understanding of AI through interpretability research. Building the future of safe and powerful AI systems.

ID: 1818809548037062656

linkhttp://goodfire.ai calendar_today01-08-2024 00:42:57

138 Tweet

5,5K Followers

12 Following

Goodfire

@goodfireai

8 months ago

Some exciting work cooking from Nick mark bissell on neural programming image models. Check out a sneak peek in our launch video 🦁🧑‍🎄

thumb_up_off_alt122

chat_bubble_outline3

repeat10

shareShare

Patrick Hsu

@pdhsu

8 months ago

Congratulations! It’s been a joy to work with the amazing Goodfire team

thumb_up_off_alt71

chat_bubble_outline2

repeat6

shareShare

“We are thus in a race between interpretability and model intelligence.” It’s never been a more exciting time to work in AI interpretability. We share Dario’s belief that we can understand and design the mind of AI models, and that we must do so urgently.

thumb_up_off_alt166

chat_bubble_outline1

repeat7

shareShare

Goodfire

@goodfireai

8 months ago

Hear from our CTO, Dan Balsam, about Field Research at Goodfire. The Field Team deploys cutting-edge interpretability research in real-world AI settings -- helping customers uncover hidden insights in their models across bio, language, and other domains.

thumb_up_off_alt80

chat_bubble_outline1

repeat5

shareShare

Tom McGrath

@banburismus_

8 months ago

the recent 4o sycophancy and o3 lying results are really interesting - we’re already limited by our level of understanding. we wouldn’t push code to production without review, but that’s what’s happened here: neural networks are code too, just code we can’t review. or can we?

thumb_up_off_alt80

chat_bubble_outline1

repeat4

shareShare

mark bissell

@markmbissell

7 months ago

painting > prompting excited for this to be public soon! more to share throughout the week

thumb_up_off_alt20

chat_bubble_outline1

repeat2

shareShare