Cheng Lu (@clu_cheng) Twitter Tweets • TwiCopy

Cheng Lu

@clu_cheng

+ Follow

Member of technical staff @OpenAI. PhD @Tsinghua_Uni. Interested in scalable generative models.

ID: 1235901818808352768

linkhttps://luchengthu.github.io calendar_today06-03-2020 12:15:36

111 Tweet

5,5K Followers

179 Following

Chongxuan Li

@lichongxuan

9 months ago

🚀【Large Language Diffusion Models】#DiffusionModels #LLM #LLaDA We built LLaDA-8B—the FIRST non-autoregressive model rivaling LLaMA3! CRUSHES Llama2-7B on ~20 tasks while unlocking ICL/instruction-following/multi-turn chat

thumb_up_off_alt80

chat_bubble_outline3

repeat27

shareShare

Cheng Lu

@clu_cheng

8 months ago

Since the behaviors of consistency models are quite different in pixel and latent spaces, I wonder if using these new AEs can further improve the training of consistency models

thumb_up_off_alt16

chat_bubble_outline0

repeat1

shareShare

Cheng Lu

@clu_cheng

8 months ago

Still think consistency models are bad at scale? In fact, sCM can be stably scaled to modern text-to-image diffusion models and greatly improve the generation speed and 1-step generation quality!

thumb_up_off_alt56

chat_bubble_outline3

repeat4

shareShare

Gabriel Goh

@gabeeegoooh

7 months ago

we finally know

thumb_up_off_alt79

chat_bubble_outline1

repeat2

shareShare

dmed

@dmed256

7 months ago

Lots of fun w/ Gabriel Goh Lu Liu A Jabri Kenji Hata Jianfeng Wang Dian Ang Yap Prafulla Dhariwal

Lots of fun w/ <a href="/gabeeegoooh/">Gabriel Goh</a> <a href="/eliza_luth/">Lu Liu</a> <a href="/ajabri/">A Jabri</a> <a href="/kenjihata/">Kenji Hata</a> <a href="/jianfw/">Jianfeng Wang</a> <a href="/yapdianang/">Dian Ang Yap</a> <a href="/prafdhar/">Prafulla Dhariwal</a>

thumb_up_off_alt55

chat_bubble_outline5

repeat6

shareShare

Allan Jabri

@ajabri

7 months ago

kinda true

thumb_up_off_alt86

chat_bubble_outline2

repeat5

shareShare

Kenji Hata

@kenjihata

7 months ago

if you want to see someone truly passionate about image generation, look no further than Gabriel Goh he lives and breathes making image generation wonderful.

thumb_up_off_alt22

chat_bubble_outline0

repeat5

shareShare

Allan Jabri

@ajabri

7 months ago

fixed this with 4o

thumb_up_off_alt455

chat_bubble_outline7

repeat34

shareShare

Cheng Lu

@clu_cheng

7 months ago

Congrats on everyone who worked in the GPT-4o image generation team! It’s really impressive and I’m so proud of us! It was also quite enjoyable working with such a group of talented people!

thumb_up_off_alt202

chat_bubble_outline8

repeat9

shareShare

Cheng Lu

@clu_cheng

7 months ago

I closely worked with and learned a lot from Gabriel Goh and A Jabri , and it can be summarized in the old saying: be the change you want to see

thumb_up_off_alt23

chat_bubble_outline0

repeat2

shareShare

Richard Sutton

@richardssutton

7 months ago

I’ve changed so little. From my 1978 Bachelor’s thesis: “The adult human mind is very complex, but the question remains open whether the learning processes that constructed it in interaction with the environment are similarly complex. Much evidence and many peoples’ intuitions

thumb_up_off_alt542

chat_bubble_outline10

repeat64

shareShare

Cheng Lu

@clu_cheng

7 months ago

So true

thumb_up_off_alt11

chat_bubble_outline2

repeat1

shareShare

Cheng Lu

@clu_cheng

5 months ago

A very promising direction for real-time video generation! arxiv.org/abs/2506.01380 nextframed.github.io 1. You can always use DPM-Solver++ to accelerate your flow matching model. 2. sCM can even scale to video diffusion model and boost the sample quality a lot!

thumb_up_off_alt146

chat_bubble_outline1

repeat21

shareShare

Cheng Lu

@clu_cheng

4 months ago

So true

thumb_up_off_alt17

chat_bubble_outline0

repeat0

shareShare

Boaz Barak

@boazbaraktcs

4 months ago

I didn't want to post on Grok safety since I work at a competitor, but it's not about competition. I appreciate the scientists and engineers at xAI but the way safety was handled is completely irresponsible. Thread below.

thumb_up_off_alt5,5K

chat_bubble_outline326

repeat335

shareShare

Cheng Lu

@clu_cheng

3 months ago

Congrats! This is an incredible milestone and I was truly shocked by it. “Thinking for hours” means 10x or even 100x of current test-time compute, and I can’t wait to see the model think for days, months, years, centuries to solve the science challenges!

thumb_up_off_alt305

chat_bubble_outline10

repeat11

shareShare

Cheng Lu

@clu_cheng

3 months ago

Exact same feeling

thumb_up_off_alt196

chat_bubble_outline1

repeat2

shareShare