Akash Sharma
@akashshrm02
PhD candidate @CMU_Robotics | Visiting researcher @AIatMeta
Interested in multimodal robot perception (vision, touch, audio)
ID: 2546591193
http://akashsharma02.github.io 14-05-2014 04:44:15
208 Tweet
506 Followers
735 Following
The secret behind this demo by Sudharshan Suresh #RSS2025 Keypoints, object poses, defined grasps and MPC
Checkout DemoDiffusion from Homanga Bharadhwaj! The key idea is quite simple: denoise from human trajectories instead of random noise! I am hopeful to see this scaled up to more complex embodiments!
TRI's latest Large Behavior Model (LBM) paper landed on arxiv last night! Check out our project website: toyotaresearchinstitute.github.io/lbm1/ One of our main goals for this paper was to put out a very careful and thorough study on the topic to help people understand the state of the
Reconstruct, Inpaint, Finetune: Dynamic Novel-view Synthesis from Monocular Videos Kaihua Chen, Tarasha Khurana, Deva Ramanan tl;dr: in title; fine-tune CogVideoX->train 2D video-inpainter arxiv.org/abs/2507.12646
CogNVS was accepted to NeurIPS Conference 2025! 🎉We are releasing the code today for you all to try: 🆕Code: github.com/Kaihua-Chen/co… Paper: arxiv.org/pdf/2507.12646 With CogNVS, we reformulate dynamic novel-view synthesis as a structured inpainting task: (1) we reconstruct input
A reminder that accurate motion estimation sparse visual SLAM has been in the domain of industry for many years now, and what you might often see in academic papers as the "state of the art" is fairly meaningless. (From Paul-Edouard Sarlin @pesarlin.bsky.social.bsky.social)
One of the reasons for spending sabbatical time at CMU, was to meet new people. Just finished a GREAT walk and talk with Tarasha Khurana It was a pleasure meeting you Tarasha, keep in touch! Check out her work: cs.cmu.edu/~tkhurana/