chopwatercarry (@chopwatercarry) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Ben Burtenshaw

@ben_burtenshaw

3 months ago

Qwen3 Finetuning Notebook. I’m tuning Qwen 3 for a fast local coding, and here’s a notebook for the process. 🧵More in the thread. More to come

Qwen3 Finetuning Notebook. I’m tuning <a href="/Alibaba_Qwen/">Qwen</a> 3 for a fast local coding, and here’s a notebook for the process.

🧵More in the thread. More to come

thumb_up_off_alt621

chat_bubble_outline9

repeat81

shareShare

The implied point of view - "we are helpless passive entities unable to change this, merely to predict it" - kinda annoys me. Much like "p of doom". Prediction is much less important than creation. But I can't quite resist the temptation to check...

thumb_up_off_alt40

chat_bubble_outline1

repeat3

shareShare

Justus Mattern

@matternjustus

3 months ago

We went from our first line of code for prime-rl to releasing INTELLECT-2 in around two months. Now that our infra is in place and proven to work, I’m very optimistic that it’s only a matter of time until we will catch up with frontier labs. Some thoughts on this release ⬇️

thumb_up_off_alt395

chat_bubble_outline18

repeat27

shareShare

Adam Rodman

@adamrodmanmd

2 months ago

Huge update to our preprint today on the superhuman performance of reasoning models in medical diagnosis! TL;DR – they don't just surpass humans in meaningful benchmarks, but in actual medical care from unstructured clinical data: A 🧵⬇️: x.com/AdamRodmanMD/s…

thumb_up_off_alt563

chat_bubble_outline16

repeat107

shareShare

Shane Gu

@shaneguml

2 months ago

#veo3 is truly incredible. Here's my old explanation on why video/audio progress faster than text. 2025 is the year of agents and grokking physics.

thumb_up_off_alt624

chat_bubble_outline10

repeat53

shareShare

Lifan Yuan

@lifan__yuan

2 months ago

We always want to scale up RL, yet simply training longer doesn't necessarily push the limits - exploration gets impeded by entropy collapse. We show that the performance ceiling is surprisingly predictable, and the collapse is driven by covariance between logp and advantage.

thumb_up_off_alt546

chat_bubble_outline8

repeat85

shareShare

chopwatercarry

@chopwatercarry

2 months ago

I haven't been so worried about videogen effect on misinformation because there are relatively straightforward technological solutions where public figures can verify clips etc. But still, it is now in 2025 that they have to be rolled out

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Adib

@adibvafa

2 months ago

Introducing the world's first reasoning model in biology! 🧬 BioReason enables AI to reason about genomics like a biology expert. A thread 🧵:

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat249

shareShare

chopwatercarry

@chopwatercarry

2 months ago

anthropic's "race to the top" is not going too well Anthropic

anthropic's "race to the top" is not going too well <a href="/AnthropicAI/">Anthropic</a>

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

chopwatercarry

@chopwatercarry

2 months ago

flops per capita

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

chopwatercarry

@chopwatercarry

2 months ago

I feel like OpenAI voice mode changed the tone of voice (not the persona) to have a different laid back cool vibe overnight. Don’t know if that is actually the case but if so it not good to do so without giving some kind of notice

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

kyutai

@kyutai_labs

a month ago

Kyutai Speech-To-Text is now open-source! It’s streaming, supports batched inference, and runs blazingly fast: perfect for interactive applications. Check out the details here: kyutai.org/next/stt

thumb_up_off_alt498

chat_bubble_outline25

repeat97

shareShare

Gokul Swamy

@g_k_swamy

a month ago

It was a dream come true to teach the course I wish existed at the start of my PhD. We built up the algorithmic foundations of modern-day RL, imitation learning, and RLHF, going deeper than the usual "grab bag of tricks". All 25 lectures + 150 pages of notes are now public! 🧵

thumb_up_off_alt691

chat_bubble_outline7

repeat87

shareShare

David Hall

@dlwh

a month ago

So about a month ago, Percy posted a version of this plot of our Marin 32B pretraining run. We got a lot of feedback, both public and private, that the spikes were bad. (This is a thread about how we fixed the spikes. Bear with me. )

thumb_up_off_alt968

chat_bubble_outline21

repeat94

shareShare

Davis Blalock

@davisblalock

a month ago

While I briefly have no employer, let me tell you what's really happening with AI companies training on public data: [1/n]

thumb_up_off_alt4,4K

chat_bubble_outline73

repeat347

shareShare

JMBollenbacher

@jmbollenbacher_

16 days ago

The AI world needs to learn this lesson before its too late. We don't want an Oppenheimer amongst us. Physics has never fully gotten clean of that sin. We still talk about it. The AI community should learn from this. 6/6

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Daniel Kokotajlo

@dkokotajlo

15 days ago

I'm very happy to see this happen. I think that we're in a vastly better position to solve the alignment problem if we can see what our AIs are thinking, and I think that we sorta mostly can right now, but that by default in the future companies will move away from this paradigm

thumb_up_off_alt175

chat_bubble_outline7

repeat12

shareShare

chopwatercarry

Gate.io

Ben Burtenshaw

Michael Nielsen

Justus Mattern

Adam Rodman

Shane Gu

Lifan Yuan

chopwatercarry

Adib

chopwatercarry

chopwatercarry

chopwatercarry

kyutai

Gokul Swamy

David Hall

Davis Blalock

JMBollenbacher

Daniel Kokotajlo