Vivek Myers (@vivek_myers) 's Twitter Profile
Vivek Myers

@vivek_myers

PhD student @Berkeley_AI | reinforcement learning | 🦋 @ vivekmyers.bsky.social

ID: 1201320030547431424

calendar_today02-12-2019 01:59:55

131 Tweet

896 Followers

396 Following

Chongyi Zheng (@chongyiz1) 's Twitter Profile Photo

1/ How should RL agents prepare to solve new tasks? While prior methods often learn a model that predicts the immediate next observation, we build a model that predicts many steps into the future, conditioning on different user intentions: chongyi-zheng.github.io/infom.

Seohong Park (@seohong_park) 's Twitter Profile Photo

New paper on unsupervised pre-training for RL! The idea is to learn a flow-based future prediction model for each "intention" in the dataset. We can then use these models to estimate values for fine-tuning.

Seohong Park (@seohong_park) 's Twitter Profile Photo

Q-learning is not yet scalable seohong.me/blog/q-learnin… I wrote a blog post about my thoughts on scalable RL algorithms. To be clear, I'm still highly optimistic about off-policy RL and Q-learning! I just think we haven't found the right solution yet (the post discusses why).

Q-learning is not yet scalable

seohong.me/blog/q-learnin…

I wrote a blog post about my thoughts on scalable RL algorithms.

To be clear, I'm still highly optimistic about off-policy RL and Q-learning! I just think we haven't found the right solution yet (the post discusses why).
Siddharth Karamcheti (@siddkaramcheti) 's Twitter Profile Photo

Thrilled to share that I'll be starting as an Assistant Professor at Georgia Tech (Georgia Tech School of Interactive Computing / Robotics@GT / Machine Learning at Georgia Tech) in Fall 2026. My lab will tackle problems in robot learning, multimodal ML, and interaction. I'm recruiting PhD students this next cycle – please apply/reach out!

Thrilled to share that I'll be starting as an Assistant Professor at Georgia Tech (<a href="/ICatGT/">Georgia Tech School of Interactive Computing</a> / <a href="/GTrobotics/">Robotics@GT</a> / <a href="/mlatgt/">Machine Learning at Georgia Tech</a>) in Fall 2026.

My lab will tackle problems in robot learning, multimodal ML, and interaction. I'm recruiting PhD students this next cycle – please apply/reach out!
Andrew Wagenmaker (@ajwagenmaker) 's Twitter Profile Photo

Diffusion policies have demonstrated impressive performance in robot control, yet are difficult to improve online when 0-shot performance isn’t enough. To address this challenge, we introduce DSRL: Diffusion Steering via Reinforcement Learning. (1/n) diffusion-steering.github.io

Qiyang Li (@qiyang_li) 's Twitter Profile Photo

Everyone knows action chunking is great for imitation learning. It turns out that we can extend its success to RL to better leverage prior data for improved exploration and online sample efficiency! colinqiyangli.github.io/qc/ The recipe to achieve this is incredibly simple. 🧵 1/N

Andrew Wagenmaker (@ajwagenmaker) 's Twitter Profile Photo

How can we train a foundation model to internalize what it means to “explore”? Come check out our work on “behavioral exploration” at ICML25 to find out!

How can we train a foundation model to internalize what it means to “explore”?

Come check out our work on “behavioral exploration” at ICML25 to find out!
Eric Frankel (@esfrankel) 's Twitter Profile Photo

Tomorrow, I'm excited to present "Finite-Time Convergence Rates in Stochastic Stackelberg Games with Smooth Algorithmic Agents", which addresses how a principal can influence the behavior of competitive learning agents! #ICML2025 📍West Exhibition Hall, W-817, 11:00 - 1:30 🧵👇

Tomorrow, I'm excited to present "Finite-Time Convergence Rates in Stochastic Stackelberg Games with Smooth Algorithmic Agents", which addresses how a principal can influence the behavior of competitive learning agents! #ICML2025

📍West Exhibition Hall, W-817, 11:00 - 1:30

🧵👇
Data On the Brain & Mind Workshop @NeurIPS2025 (@dataonbrainmind) 's Twitter Profile Photo

🚨 Excited to announce our #NeurIPS2025 Workshop: Data on the Brain & Mind 📣 Call for: Findings (4- or 8-page) + Tutorials tracks 🎙️ Speakers include FieteGroup Daniel Yamins Cengiz Pehlevan Rajesh P. N. Rao Laura Gwilliams 🌐 Learn more: data-brain-mind.github.io

🚨 Excited to announce our #NeurIPS2025 Workshop: Data on the Brain &amp; Mind

📣 Call for: Findings (4- or 8-page) + Tutorials tracks

🎙️ Speakers include <a href="/FieteGroup/">FieteGroup</a> <a href="/dyamins/">Daniel Yamins</a> <a href="/CPehlevan/">Cengiz Pehlevan</a> <a href="/RajeshPNRao/">Rajesh P. N. Rao</a> <a href="/GwilliamsL/">Laura Gwilliams</a> 

🌐 Learn more: data-brain-mind.github.io
Joey Hejna (@joeyhejna) 's Twitter Profile Photo

We're hosting the 1st workshop on Making Sense of Data in Robotics at Conference on Robot Learning this year! We'll investigate what makes robot learning data "good" by discussing: 🧩 Data Composition 🧹 Data Curation 💡 Data Interpretability Paper submissions are due 8/22/2025! 🧵(1/3)

Catherine Glossop (@catglossop) 's Twitter Profile Photo

Inherent biases and imbalances in robot data can make training steerable VLA policies challenging. We introduce CAST, a method to augment datasets with counterfactuals to induce better language following cast-vla.github.io ← paper, code, data, and more available here! 🧵

Data On the Brain & Mind Workshop @NeurIPS2025 (@dataonbrainmind) 's Twitter Profile Photo

📢10 days left to submit to the Data on the Brain & Mind Workshop at #NeurIPS2025 📝Call for: • Findings • Tutorials Perfect if you’re prepping for ICLR or already in NeurIPS, show how to use a cog neuro dataset by submitting to our tutorial track!🔗data-brain-mind.github.io

Alicja Ziarko (@ziarkoalicja) 's Twitter Profile Photo

Can complex reasoning emerge directly from learned representations? In our new work, we study representations that capture both perceptual and temporal structure, enabling agents to reason without explicit planning. princeton-rl.github.io/CRTR/

Data On the Brain & Mind Workshop @NeurIPS2025 (@dataonbrainmind) 's Twitter Profile Photo

🧠 Working with neuro datasets? You can submit a notebook that shows how to work with the dataset—explaining how to process it, and potential applications—as a tutorial to the Data on the Brain and Mind workshop. 📅 Extended deadline: Sept 8, 2025 🔗 data-brain-mind.github.io

Data On the Brain & Mind Workshop @NeurIPS2025 (@dataonbrainmind) 's Twitter Profile Photo

🚨 Tutorial Track deadline extended! Now Sept 12 (AoE) Working with neuro datasets? Submit a notebook that: • Shows how to process the data • Explains potential applications OpenReview: openreview.net/group?id=NeurI… More Information: data-brain-mind.github.io

Chongyi Zheng (@chongyiz1) 's Twitter Profile Photo

1/ How can we model the future rewards (returns) for RL agents? While prior methods round the returns into discrete bins or predict a finite number of quantiles, we use flexible models to predict the fine-grained structure of the full return distribution: pd-perry.github.io/value-flows.