Scott Reed (@scott_e_reed) Twitter Tweets • TwiCopy

Dimitris Papailiopoulos

4 months ago

I tested phi-4-reasoning on my early grad lin algebra (private) final exam at UW-Madison. It scored 100% on the first run.. Two years ago I speculated nothing useful could run locally anytime soon. I was wrong. Kids can now have a free, grad level TA, running on their PC

thumb_up_off_alt740

chat_bubble_outline14

repeat78

shareShare

Xuxin Cheng

@xuxin_cheng

4 months ago

Meet 𝐀𝐌𝐎 — our universal whole‑body controller that unleashes the 𝐟𝐮𝐥𝐥  kinematic workspace of humanoid robots to the physical world. AMO is a single policy trained with RL + Hybrid Mocap & Trajectory‑Opt. Accepted to #RSS2025. Try our open models & more 👉

thumb_up_off_alt550

chat_bubble_outline23

repeat113

shareShare

Arthur Allshire

@arthurallshire

4 months ago

our policy is just joystick conditioned -- pull it back towards a chair, and it knows to sit. push it forward, it knows to stand. We call this contextual humanoid control please see more results and paper at videomimic.net

thumb_up_off_alt47

chat_bubble_outline1

repeat4

shareShare

Jim Fan

@drjimfan

4 months ago

The Physical Turing Test: your house is a complete mess after a Sunday hackathon. On Monday night, you come home to an immaculate living room and a candlelight dinner. And you couldn't tell whether a human or a machine had been there. Deceptively simple, insanely hard. It is the

thumb_up_off_alt1,1K

chat_bubble_outline68

repeat209

shareShare

Office of Nuclear Energy | US Department of Energy

@govnuclear

4 months ago

Don’t blink. The nuclear renaissance is underway thanks to President Donald J. Trump.

thumb_up_off_alt1,1K

chat_bubble_outline57

repeat124

shareShare

Department of State

@statedept

4 months ago

Today Deputy Secretary Christopher Landau welcomed the first group of Afrikaner refugees fleeing persecution from their native South Africa. We stand with these refugees, many of them farmers and former business owners, as they build a better future for themselves and their children here in the

thumb_up_off_alt53,53K

chat_bubble_outline3,3K

repeat8,8K

shareShare

Tong Zhang

@tongzha22057330

4 months ago

🤖 Can a humanoid robot hold extreme single-leg poses like Bruce Lee's Kick or the Swallow Balance? 🤸 💥 YES. Meet HuB: Learning Extreme Humanoid Balance 🔗 Project website: hub-robot.github.io

thumb_up_off_alt308

chat_bubble_outline11

repeat62

shareShare

ib

@indian_bronson

4 months ago

In another 25 years, we’ll have fully corrected the mistakes the Boomers made. It’ll be like pulling up carpets and removing drop ceilings to see the beautiful hardwood or beams and plaster they covered up for some reason.

thumb_up_off_alt2,2K

chat_bubble_outline16

repeat139

shareShare

Covfefe Anon

@covfefeanon

4 months ago

The one lesson progs took from the Cold War was "no more West Berlins"

thumb_up_off_alt622

chat_bubble_outline3

repeat51

shareShare

Scott Reed

@scott_e_reed

4 months ago

Looks very promising! It is indeed unsatisfying that contemporary VLA policies tend to use a single step of context. I would also be curious if this can improve language following and convergence speed compared to discrete token VLA models, which seems to be a weak point of

thumb_up_off_alt18

chat_bubble_outline2

repeat1

shareShare

Jesse Zhang

@jesse_y_zhang

4 months ago

Given only successful trajectories, how do we learn to reward unsuccessful rollouts and generalize across tasks? We train with video rewinding, instruction augmentation, and OXE data! For rewinding, we randomly reverse videos to learn to predict decreasing rewards. (3/N)

thumb_up_off_alt12

chat_bubble_outline2

repeat3

shareShare

Joel Jang

@jang_yoel

4 months ago

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

thumb_up_off_alt326

chat_bubble_outline7

repeat65

shareShare

Curtis Yarvin

@curtis_yarvin

4 months ago

The boss move: arrest him in the Oval Office, fly him directly to The Hague

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat50

shareShare

Edward Johns

@ed__johns

4 months ago

A few years ago, humanoids with legs walking around the ICRA exhibition was the new thing. This time, it’s the year of the hands! Tons and tons of humanoid hands! #ICRA2025

thumb_up_off_alt309

chat_bubble_outline12

repeat51

shareShare

Brendan O'Donoghue

@bodonoghue85

4 months ago

One cool feature that diffusion models for images have is the ability to do 'inpainting', where the user can mask out some part of an image and the diffusion model can fill it in based on a prompt. Turns out something very similar can be done with text diffusion!

thumb_up_off_alt131

chat_bubble_outline6

repeat17

shareShare

Remi Cadene

@remicadene

3 months ago

Meet HopeJr, a full humanoid robot lowering the barrier to entry! Capable of walking, manipulating many objects, open-source and costs under $3000 🤯 Designed by Rob Knight and Hugging Face 👇

thumb_up_off_alt775

chat_bubble_outline78

repeat185

shareShare

Kevin Frans

@kvfrans

3 months ago

The resulting framework is simple -- train an optimality-conditioned diffusion policy, where optimality should be a monotonic function of advantage. During test time, we can dynamically interpolate between w=0 (base policy) and w=infinity (greedy policy).

thumb_up_off_alt13

chat_bubble_outline2

repeat2

shareShare

Ruijie Zheng

@ruijie_zheng12

3 months ago

How does FLARE work? FLARE adds a few learnable "future tokens" to the policy denoising network alongside state and action tokens. These latent tokens will then be used to predict observation latents H steps ahead (H = action chunk size), enabling implicit future reasoning!

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Tairan He

@tairanhe99

3 months ago

Cool and solid work. The vision-pro humanoid teleop setup is we did with OmniH2O (omni.human2humanoid.com), but this work used MoE distillation, and better lidar odometry on G1 robot. Excited to see people pushing the limits of humanoid whole-body teleop!

thumb_up_off_alt106

chat_bubble_outline2

repeat11

shareShare

Scott Reed

@scott_e_reed

3 months ago

Nice results! Similar idea in a 2015 paper applied to RNNs: Scheduled Sampling. arxiv.org/pdf/1506.03099.

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare