
Kaiwen Wang
@kaiwenw_ai
RL Research @Cornell_Tech. @Google PhD Fellow.
ID: 1233566427505778688
https://kaiwenw.github.io/ 29-02-2020 01:35:37
56 Tweet
304 Followers
479 Following


Making inferences robust to distribution shifts and hidden confounders is paramount for decision making under uncertainty. At the upcoming NeurIPS Conference, I’m excited to present our efficient and sharp algorithm for off-policy evaluation in robust markov decision processes. Many


Join us Pluralistic Alignment Workshop workshop at #NeurIPS to learn more about CLP! 🗓️ Sat, 14 Dec, 2024 🕙 10:40-11:40am PST 📍 West Meeting Room 116, 117 🔗 arxiv.org/abs/2407.15762 x.com/kaiwenw_ai/sta…



How can small LLMs match or even surpass frontier models like DeepSeek R1 and o3 Mini in math competition (AIME & HMMT) reasoning? Prior work seems to suggest that ideas like PRMs do not really work or scale well for long context reasoning. Kaiwen Wang will reveal how a novel



Correction re the time: my posters on Q# and VGS at AI for Math Workshop @ ICML 2025 is happening today from 10:50 am to 12:20 pm. Hope to see you there! x.com/kaiwenw_ai/sta…