Kaiwen Wang (@kaiwenw_ai) 's Twitter Profile
Kaiwen Wang

@kaiwenw_ai

RL Research @Cornell_Tech. @Google PhD Fellow.

ID: 1233566427505778688

linkhttps://kaiwenw.github.io/ calendar_today29-02-2020 01:35:37

56 Tweet

304 Followers

479 Following

Jason Wei (@_jasonwei) 's Twitter Profile Photo

2022: I never wrote a RL paper or worked with a RL researcher. I didn’t think RL was crucial for AGI Now: I think about RL every day. My code is optimized for RL. The data I create is designed just for RL. I even view life through the lens of RL Crazy how quickly life changes

Kaiwen Wang (@kaiwenw_ai) 's Twitter Profile Photo

Making inferences robust to distribution shifts and hidden confounders is paramount for decision making under uncertainty. At the upcoming NeurIPS Conference, I’m excited to present our efficient and sharp algorithm for off-policy evaluation in robust markov decision processes. Many

Making inferences robust to distribution shifts and hidden confounders is paramount for decision making under uncertainty.

At the upcoming <a href="/NeurIPSConf/">NeurIPS Conference</a>, I’m excited to present our efficient and sharp algorithm for off-policy evaluation in robust markov decision processes. 

Many
Kaiwen Wang (@kaiwenw_ai) 's Twitter Profile Photo

Join us Pluralistic Alignment Workshop workshop at #NeurIPS to learn more about CLP! 🗓️ Sat, 14 Dec, 2024 🕙 10:40-11:40am PST 📍 West Meeting Room 116, 117 🔗 arxiv.org/abs/2407.15762 x.com/kaiwenw_ai/sta…

Jason Gauci (@neuralnets4life) 's Twitter Profile Photo

I've made FANG billions of $ with reinforcement learning, so this episode is a long-time coming :-). Episode 180: Reinforcement Learning, drops on Monday! patreon.com/posts/180-lear…

Jon Richens (@jonathanrichens) 's Twitter Profile Photo

Are world models necessary to achieve human-level agents, or is there a model-free short-cut? Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵

Are world models necessary to achieve human-level agents, or is there a model-free short-cut?
Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵
Wen Sun (@wensun1) 's Twitter Profile Photo

How can small LLMs match or even surpass frontier models like DeepSeek R1 and o3 Mini in math competition (AIME & HMMT) reasoning? Prior work seems to suggest that ideas like PRMs do not really work or scale well for long context reasoning. Kaiwen Wang will reveal how a novel

Jin Zhou (@jinpzhou) 's Twitter Profile Photo

This captures something fundamental we're seeing in AI right now! The shift from just scaling pre-training to scaling test-time compute is huge. Our Q# + VGS work shows how value-based methods can guide models through the vast implicit graphs of reasoning possibilities.

AI for Math Workshop @ ICML 2025 (@ai4mathworkshop) 's Twitter Profile Photo

It's happening today! 📍Location: West Ballroom C, Vancouver Convention Center ⌚️Time: 8:30 am - 6:00 pm 🎥 Livestream: icml.cc/virtual/2025/w… #ICML2025 #icml25 #icml #aiformath #ai4math #workshop

It's happening today!
📍Location: West Ballroom C, Vancouver Convention Center
⌚️Time: 8:30 am - 6:00 pm
🎥 Livestream: icml.cc/virtual/2025/w…

#ICML2025 #icml25 #icml #aiformath #ai4math #workshop
Kaiwen Wang (@kaiwenw_ai) 's Twitter Profile Photo

Correction re the time: my posters on Q# and VGS at AI for Math Workshop @ ICML 2025 is happening today from 10:50 am to 12:20 pm. Hope to see you there! x.com/kaiwenw_ai/sta…