Fan-Yun Sun (@sunfanyun) Twitter Tweets • TwiCopy

Fan-Yun Sun

@sunfanyun

+ Follow

cs phd candidate @StanfordAILab @stanfordsvl
@NVIDIAAI embodied AI, 3D, code generation

ID: 1050340210863525888

linkhttps://ai.stanford.edu/~sunfanyun/ calendar_today11-10-2018 10:59:55

131 Tweet

1,1K Followers

757 Following

Yue Yang

@yueyangai

9 months ago

We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: yueyang1996.github.io/cosyn/ Dataset: huggingface.co/datasets/allen… Paper:

thumb_up_off_alt197

chat_bubble_outline6

repeat45

shareShare

Fan-Yun Sun

@sunfanyun

9 months ago

Claude 3.7 one-shotted this 3D room— everything from layout to geometry and texture. This is always a fun way to test a model’s spatial reasoning ability

thumb_up_off_alt121

chat_bubble_outline7

repeat11

shareShare

siddharth ahuja

@sidahuj

8 months ago

🧩 Built an MCP that lets Claude talk directly to Blender. It helps you create beautiful 3D scenes using just prompts! Here’s a demo of me creating a “low-poly dragon guarding treasure” scene in just a few sentences👇

thumb_up_off_alt10,10K

chat_bubble_outline366

repeat1,1K

shareShare

siddharth ahuja

@sidahuj

7 months ago

🧑‍🎨 The future of creative tools will look very different. 🧠 Imagine an AI control-centre for orchestrating complex tasks using just prompts. 📽️ Demo: Prompting an evil dragon with soundtrack using a single control centre (Claude). It uses both Blender MCP and Ableton MCP.

thumb_up_off_alt319

chat_bubble_outline13

repeat39

shareShare

Silas Alberti

@silasalberti

6 months ago

we trained Kevin-32B = K(ernel D)evin using GRPO on KernelBench it's to our knowledge the first open model trained using RL on writing CUDA kernels it beats o3 & o4-mini in correctness & performance! shoutout to Carlo Pietro Marsella Ben Pan!! x.com/cognition_labs…

thumb_up_off_alt230

chat_bubble_outline17

repeat27

shareShare

Wenlong Huang

@wenlong_huang

6 months ago

How to scale visual affordance learning that is fine-grained, task-conditioned, works in-the-wild, in dynamic envs? Introducing Unsupervised Affordance Distillation (UAD): distills affordances from off-the-shelf foundation models, *all without manual labels*. Very excited this

thumb_up_off_alt433

chat_bubble_outline8

repeat102

shareShare

Fan-Yun Sun

@sunfanyun

5 months ago

Check out this work led by Tianyu Hua! We believe code generation is the future but we need to start with better benchmarks

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Sanjana Srivastava

@sanjana__z

5 months ago

🤖 Household robots are becoming physically viable. But interacting with people in the home requires handling unseen, unconstrained, dynamic preferences, not just a complex physical domain. We introduce ROSETTA: a method to generate reward for such preferences cheaply. 🧵⬇️

thumb_up_off_alt128

chat_bubble_outline4

repeat27

shareShare