Luozhu (@luozhuzhang) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I don't have too too much to add on top of this earlier post on V3 and I think it applies to R1 too (which is the more recent, thinking equivalent). I will say that Deep Learning has a legendary ravenous appetite for compute, like no other algorithm that has ever been developed

thumb_up_off_alt14,14K

chat_bubble_outline381

repeat2,2K

shareShare

Andrej Karpathy

@karpathy

6 months ago

New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental

thumb_up_off_alt20,20K

chat_bubble_outline783

repeat3,3K

shareShare

Jim Fan

@drjimfan

6 months ago

The coolest autonomous coding agent I've seen recently: use AI to write better CUDA kernels to accelerate AI. AutoML is so back! The highest leverage thing you can do with your compute resources is to increase the future productivity of the same compute. It aligns all the stars

thumb_up_off_alt1,1K

chat_bubble_outline155

repeat204

shareShare

vitalik.eth

@vitalikbuterin

5 months ago

What Ethereum needs is a lot of young blood who shared the cypherpunk vision. All OGs are jaded. It’s on the next generation now.

thumb_up_off_alt6,6K

chat_bubble_outline2,2K

repeat617

shareShare

Luozhu

@luozhuzhang

4 months ago

ZKP is a key to crypto privacy solutions Open-source models with local deployment and user-friendly fine-tuning tools should be the key to AI privacy protections. To do that, we need knowledge distillation and more powerful small models Though I only shared a few ideas on the

thumb_up_off_alt36

chat_bubble_outline0

repeat0

shareShare

Luozhu

@luozhuzhang

4 months ago

Some interesting ideas 1. Small but smart model Beyond knowledge distillation, how can we achieve emergent intelligence (like DeepSeek-R1-Zero's "Aha moments") in smaller models? Could techniques like lower-precision training or specialized reward functions during the RL phase

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Luozhu

@luozhuzhang

3 months ago

Beautiful words incompleteideas.net/IncIdeas/Bitte…

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Minqi Jiang

@minqijiang

3 months ago

It's so fun to see RL finally work on complex real-world tasks with LLM policies, but it's increasingly clear that we lack an understanding of how RL fine-tuning leads to generalization. In the same week, we got two (awesome) papers: Absolute Zero Reasoner: Improvements on code

thumb_up_off_alt1,1K

chat_bubble_outline29

repeat212

shareShare

Luozhu

@luozhuzhang

3 months ago

amazing work!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Luozhu

@luozhuzhang

3 months ago

In the 2010s, video games were a fantastic playground for testing RL algorithms. Projects like VizDoom (vizdoom.farama.org) and Super Mario (en.wikipedia.org/wiki/Super_Mar….) were used in these research papers (arxiv.org/abs/1705.05363), alongside milestones like Atari games with DQN

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Luozhu

@luozhuzhang

3 months ago

What is vibe coding like these days? A100/H100 as your debug machine 👇

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

hardmaru

@hardmaru

2 months ago

I agree with Jensen. If you want AI development to be done safely and responsibly, you do it in the open. Don’t do it in a dark room and tell me it’s “safe”. Article archive: archive.md/CC5VZ

thumb_up_off_alt169

chat_bubble_outline3

repeat17

shareShare