Alejandro Lopez-Lira (@alejandroll10) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Congrats to the GDM team on their IMO result! I think their parallel success highlights how fast AI progress is. Their approach was a bit different than ours, but I think that shows there are many research directions for further progress. Some thoughts on our model and results 🧵

thumb_up_off_alt2,2K

chat_bubble_outline91

repeat207

shareShare

Owain Evans

@owainevans_uk

9 days ago

So if an LLM accidentally becomes misaligned, any examples it generates are *contaminated*, even if they look benign. Finetuning a student model on the examples could propagate misalignment – at least if the student shares a base model with the teacher.

thumb_up_off_alt938

chat_bubble_outline10

repeat67

shareShare

Carbon

@cognicarbon

9 days ago

I love when Claude tells me something will take 6 weeks. No dude, we are doing it this afternoon.

thumb_up_off_alt9,9K

chat_bubble_outline174

repeat616

shareShare

Alex Kontorovich

@alexkontorovich

8 days ago

Another AI system, ByteDance's SeedProver solved 4 out of 6 IMO problems *with* Lean, and solved a fifth with extended compute. This is becoming routine, like when we went to the moon for the fourth time. There is *nothing* "routine" about this!!...

thumb_up_off_alt463

chat_bubble_outline11

repeat52

shareShare

hardmaru

@hardmaru

8 days ago

ICML’s Statement about subversive hidden LLM prompts We live in a weird timeline…

thumb_up_off_alt1,1K

chat_bubble_outline67

repeat125

shareShare

Anthropic

@anthropicai

7 days ago

New Anthropic research: Building and evaluating alignment auditing agents. We developed three AI agents to autonomously complete alignment auditing tasks. In testing, our agents successfully uncovered hidden goals, built safety evaluations, and surfaced concerning behaviors.

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat186

shareShare

Sid

@sidbidasaria

7 days ago

Claude Code is getting a brand new feature: custom subagents. Type `/agents` to get started.

thumb_up_off_alt3,3K

chat_bubble_outline153

repeat410

shareShare

Nikunj Kothari

@nikunj

7 days ago

Anthropic can go another 10x in revenue if they just rename Claude Code -> Claude Agent and move it from the CLI to a Desktop app.. People just don't get how good this is 😅

thumb_up_off_alt532

chat_bubble_outline42

repeat6

shareShare

kache

@yacinemtb

6 days ago

LLMs turn 10x engineers into 1000x engineers, and 1x engineers into 0.1x engineers

thumb_up_off_alt8,8K

chat_bubble_outline264

repeat588

shareShare

Rohan Paul

@rohanpaul_ai

6 days ago

Beautiful Google Research paper. LLMs can learn in context from examples in the prompt, can pick up new patterns while answering, yet their stored weights never change. That behavior looks impossible if learning always means gradient descent. The mechanisms through which this

Beautiful <a href="/GoogleResearch/">Google Research</a> paper.

LLMs can learn in context from examples in the prompt, can pick up new patterns while answering, yet their stored weights never change.

That behavior looks impossible if learning always means gradient descent.

The mechanisms through which this

thumb_up_off_alt2,2K

chat_bubble_outline47

repeat271

shareShare

Gabriele Berton

@gabriberton

6 days ago

If you think NeurIPS reviews are getting worse because of LLMs, think again The seminal 2015 distillation paper from Jeff Hinton, Oryol Vinyals, and Jeff Dean was rejected by NeurIPS for lack of impact, was published as a workshop, and it has now 26k citations🤯

thumb_up_off_alt482

chat_bubble_outline4

repeat34

shareShare

ForLoop (qt/π, 循环)

@forloopcodes

6 days ago

i still can't believe this is your virtual girlfriend

thumb_up_off_alt15,15K

chat_bubble_outline129

repeat1,1K

shareShare

Rota

@pli_cachete

6 days ago

Just tried Claude Code. Why didn't anybody tell me.

thumb_up_off_alt3,3K

chat_bubble_outline139

repeat70

shareShare

Arpit Gupta

@arpitrage

6 days ago

A pretty good predictive diagnostic for how much your subfield of economics is making progress is how much ML is there currently in the field

thumb_up_off_alt16

chat_bubble_outline6

repeat1

shareShare

Jeffrey Emanuel

@doodlestein

4 days ago

I have a feeling I'll soon be looking back at my Claude Code usage for the past month with pangs of nostalgia about how I managed to get over $13k and counting of inferences services for $600 worth of Claude Max subscriptions. There's no way this is sustainable for Anthropic.

thumb_up_off_alt1,1K

chat_bubble_outline112

repeat48

shareShare

pnorm

@paleonormie

3 days ago

The funniest possible AGI outcome is that it’s real but the energy costs are comparable to having a human do it

thumb_up_off_alt11,11K

chat_bubble_outline264

repeat388

shareShare

Theo - t3.gg

@theo

3 days ago

I always wondered how we could possibly "accidentally" build AI that kills people. Suddenly I understand.

thumb_up_off_alt2,2K

chat_bubble_outline185

repeat103

shareShare

murat 🍥

@mayfer

3 days ago

why are they like this

thumb_up_off_alt2,2K

chat_bubble_outline67

repeat80

shareShare

Matthew Yglesias

@mattyglesias

2 days ago

Uber arbitraged away bad regulations and made life better for the vast majority of people — there is a lesson to be learned here but it’s not “uber is a cautionary tale” it’s “we should take the problem of regulatory barriers to entry seriously.”

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat46

shareShare

Gappy (Giuseppe Paleologo)

@__paleologo

2 days ago

Every fundamental portfolio manager says they have a Sharpe of 1.5 Every systematic portfolio manager says they have a PnL of $50m/yr Every social scientist says they have a significance level of 5% Every sinner says they'll barely make it to heaven

thumb_up_off_alt726

chat_bubble_outline12

repeat24

shareShare

Alejandro Lopez-Lira

Gate.io

Noam Brown

Owain Evans

Carbon

Alex Kontorovich

hardmaru

Anthropic

Sid

Nikunj Kothari

kache

Rohan Paul

Gabriele Berton

ForLoop (qt/π, 循环)

Rota

Arpit Gupta

Jeffrey Emanuel

pnorm

Theo - t3.gg

murat 🍥

Matthew Yglesias

Gappy (Giuseppe Paleologo)