
Stone Tao
@stone_tao
PhDing @UCSanDiego @HaoSuLabUCSD @hillbot_ai on scalable robot learning and embodied AI. Co-founded @LuxAIChallenge to build AI competitions. @NSF GRFP fellow
ID: 1136188068
http://stoneztao.com 31-01-2013 06:27:56
2,2K Tweet
4,4K Followers
1,1K Following










🔥🚨 Preprint alert: Relative Entropy Pathwise Policy Optimization #REPPO 🚨🔥 What if you could have on-policy training without the instability and parameter tuning that plagues #PPO? What if training with deterministic policy gradient just worked? With our new method it does!




