Chenhui Zhang (@danielz2333) Twitter Tweets • TwiCopy

Phil Eaton

@eatonphil

3 months ago

We're back at NYC Systems with Vahab from Columbia talking about achieving bare-metal performance in the cloud

thumb_up_off_alt217

chat_bubble_outline5

repeat10

shareShare

Chenhui Zhang

@danielz2333

2 months ago

🔥🔥🔥

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

*Model Merging in Pre-training of LLMs* by Yunshui Li et al. They investigate model merging in the pre-training phase (ie, averaging multiple checkpoints), showing performance comparable to learning rate annealing. arxiv.org/abs/2505.12082

*Model Merging in Pre-training of LLMs*
by <a href="/cloud2water/">Yunshui Li</a> et al.

They investigate model merging in the pre-training phase (ie, averaging multiple checkpoints), showing performance comparable to learning rate annealing.

arxiv.org/abs/2505.12082

thumb_up_off_alt146

chat_bubble_outline0

repeat20

shareShare

Giorgia Ramponi

@gio_ramponi

2 months ago

We have a very strong lineup this year! Do not miss your chance to apply for the ARLET workshop

thumb_up_off_alt27

chat_bubble_outline0

repeat1

shareShare

TTIC

@ttic_connect

2 months ago

TTIC has named Professor Avrim Blum, current Chief Academic Officer and established leader in theoretical #CS and #ML, as Interim President, effective September 2025. Read the full announcement here: buff.ly/DvyV2GD

thumb_up_off_alt60

chat_bubble_outline0

repeat3

shareShare

Cas (Stephen Casper)

@stephenlcasper

2 months ago

📌📌📌 I'm excited to be on the faculty job market this fall. I updated my website with my CV. stephencasper.com

thumb_up_off_alt166

chat_bubble_outline7

repeat22

shareShare

Pushmeet Kohli

@pushmeet

2 months ago

Imagine trying to listen for a whisper in the middle of a rock concert. This is similar to what the LIGO Gravitational wave observatory has to do every day. Today in Science Magazine, our team Google DeepMind shows how AI can help & give astronomers a deeper view of universe.

thumb_up_off_alt212

chat_bubble_outline4

repeat26

shareShare

Logan Kilpatrick

@officiallogank

2 months ago

Official Nano Banana hackathon 🍌 this weekend in SF, $50K in prizes, teams of 1-4, come build with the DeepMind team! Details below 🧵

thumb_up_off_alt1,1K

chat_bubble_outline70

repeat61

shareShare

Justin Johnson

@jcjohnss

2 months ago

10 years ago, deep learning was in its infancy. PyTorch didn't exist. Language models were recurrent, and not large. But it felt important: a new technology that would change everything. That's why Fei-Fei Li , Andrej Karpathy, and I started CS231N Staff back in 2015 - to teach the world's

thumb_up_off_alt2,2K

chat_bubble_outline42

repeat223

shareShare

Pushmeet Kohli

@pushmeet

2 months ago

Excited to share an important advance in AI and math. Together with mathematicians from Brown University, New York University and Stanford University, we developed a new AI-powered method that has discovered an entirely new family of solutions to several complex equations in fluid dynamics.

thumb_up_off_alt1,1K

chat_bubble_outline14

repeat113

shareShare

Patrick Loeber

@patloeber

2 months ago

Gemini is now in Chrome! Ctrl + G, and enjoy!

thumb_up_off_alt3,3K

chat_bubble_outline149

repeat187

shareShare

Simran Arora

@simran_s_arora

2 months ago

Very excited to share that I've finished my phd @stanford and will be joining @caltech’s cms department as an assistant professor. Looking forward to working with students and colleagues on ml systems! Grateful to my amazing advisor and labmates @hazyresearch for the best time

thumb_up_off_alt2,2K

chat_bubble_outline122

repeat56

shareShare

Surya Ganguli

@suryaganguli

a month ago

Teaching a new course Stanford University this quarter on explainable AI, motivated by neuroscience. I have curated a paper list 4 pages long (link in comment). What are your favorite papers on explainable AI/mechanistic interpretability that I am missing? Please comment or DM. thanks!

Teaching a new course <a href="/Stanford/">Stanford University</a> this quarter on explainable AI, motivated by neuroscience. I have curated a paper list 4 pages long (link in comment). What are your favorite papers on explainable AI/mechanistic interpretability that I am missing? Please comment or DM. thanks!

thumb_up_off_alt1,1K

chat_bubble_outline49

repeat245

shareShare

Stanford HAI

@stanfordhai

a month ago

“When only a few have the resources to build and benefit from AI, we leave the rest of the world waiting at the door,” said Stanford HAI Senior Fellow Yejin Choi during her address to the United Nations Security Council. Read her full speech here: hai.stanford.edu/policy/yejin-c…

“When only a few have the resources to build and benefit from AI, we leave the rest of the world waiting at the door,” said <a href="/StanfordHAI/">Stanford HAI</a> Senior Fellow <a href="/YejinChoinka/">Yejin Choi</a> during her address to the <a href="/UN/">United Nations</a> Security Council. Read her full speech here: hai.stanford.edu/policy/yejin-c…

thumb_up_off_alt200

chat_bubble_outline4

repeat42

shareShare

Jeremy Howard

@jeremyphoward

a month ago

Chris stopped coding for a few weeks to raise $250m, then straight back to coding!🚀

thumb_up_off_alt2,2K

chat_bubble_outline22

repeat83

shareShare

Yilun Du

@du_yilun

a month ago

Excited to share Equilibrium Matching (EqM)! EqM simplifies and outperforms flow matching, enabling strong generative performance of FID 1.96 on ImageNet 256x256. EqM learns a single static EBM landscape for generation, enabling a simple gradient-based generation procedure.

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat164

shareShare

Raluca Ada Popa

@ralucaadapopa

a month ago

I am proud to share the announcement about our CodeMender project at Google DeepMind, an agent that can automatically fix a range of code security vulnerabilities. From only a modest-compute run, our agent submitted 72 high-quality fixes to vulnerable code in popular codebases,

I am proud to share the announcement about our CodeMender project at <a href="/GoogleDeepMind/">Google DeepMind</a>, an agent that can automatically fix a range of code security vulnerabilities. From only a modest-compute run, our agent submitted 72 high-quality fixes to vulnerable code in popular codebases,

thumb_up_off_alt539

chat_bubble_outline20

repeat80

shareShare

Pushmeet Kohli

@pushmeet

a month ago

Following up on the AlphaEvolve code opt. agent, I am happy to share how our team at Google DeepMind has developed the CodeMender agent to design/apply patches to fix security vulnerabilities in large scale open source projects. #AI4code Read more at: deepmind.google/discover/blog/…

thumb_up_off_alt215

chat_bubble_outline4

repeat26

shareShare

Kevin Patrick Murphy

@sirbayes

a month ago

I am pleased to announce our new paper, which provides an extremely sample-efficient way to create an agent that can perform well in multi-agent, partially-observed, symbolic environments. The key idea is to use LLM-powered code synthesis to learn a code world model (in the form

thumb_up_off_alt768

chat_bubble_outline15

repeat96

shareShare

Demis Hassabis

@demishassabis

a month ago

We processed over 1.3 Quadrillion tokens last month - that's 1,300,000,000,000,000 tokens! or to put it another way that's 500M tokens a second or 1.8 Trillion tokens an hour... 🤯

thumb_up_off_alt5,5K

chat_bubble_outline300

repeat463

shareShare

Chenhui Zhang

Phil Eaton

Chenhui Zhang

Simone Scardapane

Giorgia Ramponi

TTIC

Cas (Stephen Casper)

Pushmeet Kohli

Logan Kilpatrick

Justin Johnson

Pushmeet Kohli

Patrick Loeber

Simran Arora

Surya Ganguli

Stanford HAI

Jeremy Howard

Yilun Du

Raluca Ada Popa

Pushmeet Kohli

Kevin Patrick Murphy

Demis Hassabis