Zhuang Liu (@liuzhuang1234) Twitter Tweets • TwiCopy

Zhuang Liu

@liuzhuang1234

+ Follow

Assistant Professor @PrincetonCS. deep learning, vision, models. previously research scientist @MetaAI, PhD @Berkeley_EECS

ID: 715983162137030657

linkhttp://cs.princeton.edu/~zhuangl/ calendar_today01-04-2016 19:24:23

370 Tweet

9,9K Followers

1,1K Following

David Yin

@davidyin0609

8 months ago

Please come to our poster today — I’ll be there to present our work! “A Coefficient Makes SVRG Effective” Friday, 3:00–5:30 Hall 3 + Hall 2B #385 iclr.cc/virtual/2025/p…

thumb_up_off_alt17

chat_bubble_outline0

repeat2

shareShare

our ICLR 2025 work on making a classic optimization technique practically effective in deep learning training I'm by no means an optimization expert - credits to my incoming PhD student David Yin for leading this work

thumb_up_off_alt58

chat_bubble_outline0

repeat4

shareShare

Zhuang Liu

@liuzhuang1234

8 months ago

Not at ICLR myself, but David Yin will kindly give the oral and poster presentation on my behalf, for our dataset bias paper. David knows the work as well as I know it (if not better). Check it out, happening today! Oral: Saturday, 10:42 — 10:54am, Peridot 202-203

Not at ICLR myself, but <a href="/DavidYin0609/">David Yin</a> will kindly give the oral and poster presentation on my behalf, for our dataset bias paper.

David knows the work as well as I know it (if not better). Check it out, happening today!

Oral: Saturday, 10:42 — 10:54am, Peridot 202-203

thumb_up_off_alt55

chat_bubble_outline0

repeat7

shareShare

SOUVIK KUNDU

@thisissouvikk

8 months ago

👉👉#ICLR2025 #SCOPE workshop has just kicked off, now!!!! Those who are in person, please come and join us at Singapore expo, Peridot 204-205 ( 9am - 6pm Singapore time). Schedule here: scope-workshop.github.io Keynote talks in morning session (9 am onwards): Yu Cheng

thumb_up_off_alt35

chat_bubble_outline1

repeat5

shareShare

Polina Kirichenko

@polkirichenko

8 months ago

Check out our new paper on training multi-modal LLMs to answer complex visual questions, lead by superstar students Xindi Wu Will Hwang ! We show that increasing the representation of compositionally complex questions in visual instruction tuning data boosts performance

thumb_up_off_alt62

chat_bubble_outline1

repeat7

shareShare

Zhuang Liu

@liuzhuang1234

8 months ago

Accepted to #ICML 25 & also recently featured in CMU news and Fast Company: cs.cmu.edu/news/2025/llm-… fastcompany.com/91286162/ai-ch…

thumb_up_off_alt133

chat_bubble_outline1

repeat9

shareShare

Xiaozhe Yao

@xiaozheyao

7 months ago

COLM reviewer guideline is next level. I am touched and cannot agree more. Conference on Language Modeling

COLM reviewer guideline is next level. I am touched and cannot agree more. <a href="/COLM_conf/">Conference on Language Modeling</a>

thumb_up_off_alt81

chat_bubble_outline2

repeat15

shareShare

Cihang Xie

@cihangxie

7 months ago

Still relying on OpenAI’s CLIP — a model released 4 years ago with limited architecture configurations — for your Multimodal LLMs? 🚧 We’re excited to announce OpenVision: a fully open, cost-effective family of advanced vision encoders that match or surpass OpenAI’s CLIP and

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat193

shareShare

Zhiyuan Li

@zhiyuanli_

7 months ago

Excited to share our new method ✏️PENCIL! It decouples space complexity from time complexity in LLM reasoning, by allowing model to recursively erase and generate thoughts. Joint work w. my student Chenxiao Yang , along with Nati Srebro Bartom and David McAllester.

thumb_up_off_alt35

chat_bubble_outline1

repeat9

shareShare

Zhuang Liu

@liuzhuang1234

7 months ago

Very surprised at what a non-reasoning model (4o) can do. The first answer, "Long Branch, NJ", is correct It's a new chat session with memory turned off. Not sure how it arrived at it

thumb_up_off_alt29

chat_bubble_outline4

repeat1

shareShare

Zhuang Liu

@liuzhuang1234

7 months ago

And this one I asked it why. It did provide some valid reasoning, but not sure how these together leads to "strongly resembles Long Branch, New Jersey" new session still in 4o, not a "reasoning" model per se

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Zhuang Liu

@liuzhuang1234

7 months ago

This is almost exactly what I've been working on / thinking about lately

thumb_up_off_alt72

chat_bubble_outline3

repeat0

shareShare

Jitendra MALIK

@jitendramalikcv

7 months ago

Angjoo Kanazawa Angjoo Kanazawa and I taught CS 280, graduate computer vision, this semester at UC Berkeley. We found a combination of classical and modern CV material that worked well, and are happy to share our lecture material from the class. cs280-berkeley.github.io Enjoy!

thumb_up_off_alt745

chat_bubble_outline8

repeat102

shareShare

Jieneng Chen

@jieneng_chen

7 months ago

Big congrats to TaiMing Lu on the research award & CRA finalist! And shoutout to Daniel Khashabi 🕊️ for the teaching award—NLP:SSM (self-supervised.cs.jhu.edu) is a must-take!

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Lucas Beyer (bl16)

@giffmana

7 months ago

They study model merging (EMA, soups), which is well-understood for fine-tunings of a base model, but they investigate *pre-training* of LLMs. My summary thread. (YoU wOn'T bEliEvE the surprise in post #5 which makes me like this group a lot more!)

thumb_up_off_alt541

chat_bubble_outline14

repeat51

shareShare