Fan-Yun Sun (@sunfanyun) 's Twitter Profile
Fan-Yun Sun

@sunfanyun

cs phd candidate @StanfordAILab @stanfordsvl
@NVIDIAAI embodied AI, 3D, code generation

ID: 1050340210863525888

linkhttps://ai.stanford.edu/~sunfanyun/ calendar_today11-10-2018 10:59:55

131 Tweet

1,1K Followers

757 Following

Yue Yang (@yueyangai) 's Twitter Profile Photo

We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: yueyang1996.github.io/cosyn/ Dataset: huggingface.co/datasets/allen… Paper:

We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models.

Website: yueyang1996.github.io/cosyn/
Dataset: huggingface.co/datasets/allen…
Paper:
Fan-Yun Sun (@sunfanyun) 's Twitter Profile Photo

Claude 3.7 one-shotted this 3D room— everything from layout to geometry and texture. This is always a fun way to test a model’s spatial reasoning ability

siddharth ahuja (@sidahuj) 's Twitter Profile Photo

🧩 Built an MCP that lets Claude talk directly to Blender. It helps you create beautiful 3D scenes using just prompts! Here’s a demo of me creating a “low-poly dragon guarding treasure” scene in just a few sentences👇

siddharth ahuja (@sidahuj) 's Twitter Profile Photo

🧑‍🎨 The future of creative tools will look very different. 🧠 Imagine an AI control-centre for orchestrating complex tasks using just prompts. 📽️ Demo: Prompting an evil dragon with soundtrack using a single control centre (Claude). It uses both Blender MCP and Ableton MCP.

Silas Alberti (@silasalberti) 's Twitter Profile Photo

we trained Kevin-32B = K(ernel D)evin using GRPO on KernelBench it's to our knowledge the first open model trained using RL on writing CUDA kernels it beats o3 & o4-mini in correctness & performance! shoutout to Carlo Pietro Marsella Ben Pan!! x.com/cognition_labs…

we trained Kevin-32B = K(ernel D)evin using GRPO on KernelBench

it's to our knowledge the first open model trained using RL on writing CUDA kernels

it beats o3 & o4-mini in correctness & performance!

shoutout to <a href="/carlobaronio/">Carlo</a> <a href="/pmmarsella/">Pietro Marsella</a> <a href="/ybenpan/">Ben Pan</a>!!
x.com/cognition_labs…
Wenlong Huang (@wenlong_huang) 's Twitter Profile Photo

How to scale visual affordance learning that is fine-grained, task-conditioned, works in-the-wild, in dynamic envs? Introducing Unsupervised Affordance Distillation (UAD): distills affordances from off-the-shelf foundation models, *all without manual labels*. Very excited this

Sanjana Srivastava (@sanjana__z) 's Twitter Profile Photo

🤖 Household robots are becoming physically viable. But interacting with people in the home requires handling unseen, unconstrained, dynamic preferences, not just a complex physical domain. We introduce ROSETTA: a method to generate reward for such preferences cheaply. 🧵⬇️