Sheng-Yu Wang (@shengyuwang6) Twitter Tweets • TwiCopy

maxwell jones

9 months ago

I recently gave a talk at CMU about DeepSeek v3 and DeepSeek R1 (as many people are interested haha), and the talk was recorded so I thought I'd share both the video and the slides! Hopefully they can be of use :) video: youtu.be/qGpZAnYcOvs slides: maxwelljon.es/assets/pptx/De…

thumb_up_off_alt96

chat_bubble_outline2

repeat17

shareShare

Nupur Kumari

@nupurkmr9

9 months ago

Can we generate a training dataset of the same object in different contexts for customization? Check out our work SynCD, which uses Objaverse assets and shared attention in text-to-image models for the same. cs.cmu.edu/~syncd-project/ w/ Xi Yin Jun-Yan Zhu Ishan Misra Samaneh Azadi

thumb_up_off_alt62

chat_bubble_outline0

repeat12

shareShare

Taesung Park

@taesung

8 months ago

Excited to come out of stealth at Reve! Today's text-to-image/video models, in contrast to LLMs, lack logic. Images seem plausible initially but fall apart under scrutiny: painting techniques don't match, props don't carry meaning, and compositions lack intention. (1/4)

Excited to come out of stealth at <a href="/reveimage/">Reve</a>!
Today's text-to-image/video models, in contrast to LLMs, lack logic. Images seem plausible initially but fall apart under scrutiny: painting techniques don't match, props don't carry meaning, and compositions lack intention. (1/4)

thumb_up_off_alt846

chat_bubble_outline43

repeat88

shareShare

Jun-Yan Zhu

@junyanz89

8 months ago

Hi there, Phillip Isola and I wrote a short article (500 words) on Generative Modeling for the Open Encyclopedia of Cognitive Science. We briefly discuss the basic concepts of generative models and their applications. Don't miss out Phillip Isola's hand-drawn cats in Figure 1!

thumb_up_off_alt96

chat_bubble_outline1

repeat16

shareShare

Sheng-Yu Wang

@shengyuwang6

7 months ago

Looks so cool!!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

6 months ago

LegoGPT: Generating Physically Stable and Buildable LEGO Designs from Text "We introduce LegoGPT, the first approach for generating physically stable LEGO brick models from text prompts. To achieve this, we construct a large-scale, physically stable dataset of LEGO designs,

thumb_up_off_alt241

chat_bubble_outline6

repeat43

shareShare

Jun-Yan Zhu

@junyanz89

6 months ago

We've released the code for LegoGPT. This autoregressive model generates physically stable and buildable designs from text prompts, by integrating physics laws and assembly constraints into LLM training and inference. This work is led by PhD students Ava Pun, Kangle Deng,

thumb_up_off_alt498

chat_bubble_outline8

repeat74

shareShare

Sheng-Yu Wang

@shengyuwang6

6 months ago

Speechless

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Sheng-Yu Wang

@shengyuwang6

6 months ago

youtu.be/msWKlK1UpkA?si…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Donglai Xiang

@donglaixiang

6 months ago

🚨Excited to announce the 1st Workshop on Vision Meets Physics at @CVPR2025! Join us on June 12 for a full-day event exploring the synergy between physical simulation & computer vision to bridge the gap between the virtual and physical worlds. URL: tinyurl.com/vis-phys

thumb_up_off_alt107

chat_bubble_outline2

repeat14

shareShare

Lili

@lchen915

6 months ago

One fundamental issue with RL – whether it’s for robots or LLMs – is how hard it is to get rewards. For LLM reasoning, we need ground-truth labels to verify answers. We found that maximizing confidence alone allows LLMs to improve their reasoning with RL!

thumb_up_off_alt129

chat_bubble_outline5

repeat26

shareShare

Amil Dravid

@_amildravid

5 months ago

Artifacts in your attention maps? Forgot to train with registers? Use 𝙩𝙚𝙨𝙩-𝙩𝙞𝙢𝙚 𝙧𝙚𝙜𝙞𝙨𝙩𝙚𝙧𝙨! We find a sparse set of activations set artifact positions. We can shift them anywhere ("Shifted") — even outside the image into an untrained token. Clean maps, no retrain.

thumb_up_off_alt325

chat_bubble_outline4

repeat62

shareShare

Keenan Crane

@keenanisalive

5 months ago

For folks in the ACM SIGGRAPH community: You may or may not be aware of the controversy around the next #SIGGRAPHAsia location, summarized here: cs.toronto.edu/~jacobson/webl… If you're concerned, consider signing this letter: docs.google.com/document/d/1ZS… via this form docs.google.com/forms/d/e/1FAI…

For folks in the <a href="/siggraph/">ACM SIGGRAPH</a> community:

You may or may not be aware of the controversy around the next #SIGGRAPHAsia location, summarized here: cs.toronto.edu/~jacobson/webl…

If you're concerned, consider signing this letter: docs.google.com/document/d/1ZS…
via this form
docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt27

chat_bubble_outline1

repeat8

shareShare

💖

@twaniimals

4 months ago

thumb_up_off_alt173,173K

chat_bubble_outline104

repeat11,11K

shareShare

Skild AI

@skildai

3 months ago

We’ve all seen humanoid robots doing backflips and dance routines for years. But if you ask them to climb a few stairs in the real world, they stumble! We took our robot on a walk around town to environments that it hadn’t seen before. Here’s how it works🧵⬇️

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat144

shareShare

contents that ll heal your depression 🌻

@catshealdeprsn

3 months ago

Soo realistic

thumb_up_off_alt68,68K

chat_bubble_outline130

repeat7,7K

shareShare

Nupur Kumari

@nupurkmr9

3 months ago

🚨Reminder: Submissions for short papers to the Personalization in Generative AI Workshop at #ICCV2025 are due today!!! OpenReview: openreview.net/group?id=thecv…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Sirui Chen

@eric_srchen

3 months ago

Introducing HEAD🤖, an autonomous navigation and reaching system for humanoid robots, which allows the robot to navigate around obstacles and touch an object in the environment. More details on our website and CoRL paper: stanford-tml.github.io/HEAD

thumb_up_off_alt147

chat_bubble_outline3

repeat25

shareShare

Yufei Ye

@yufei_ye

3 months ago

Delivering the robot close enough to a target is an important yet often overlooked prerequisite for any meaningful robot interaction. It requires robust locomotion, navigation, and reaching all at once. HeAD is an automatic vision-based system that handles all of them.

thumb_up_off_alt17

chat_bubble_outline2

repeat2

shareShare

Gaurav Parmar

@gauravtparmar

3 months ago

When exploring ideas with generative models, you want a range of possibilities. Instead, you often disappointingly get a gallery of near-duplicates. The culprit is standard I.I.D. sampling. We introduce a new inference method to generate high-quality and varied outputs. 1/n

thumb_up_off_alt121

chat_bubble_outline6

repeat24

shareShare