Nithish Kannen (@nithishkannen) 's Twitter Profile
Nithish Kannen

@nithishkannen

Multimodal models @GoogleDeepmind

ID: 1359122237421285378

linkhttps://nitkannen.github.io/ calendar_today09-02-2021 12:50:39

216 Tweet

875 Followers

2,2K Following

Brian Bartoldson (@bartoldson) 's Twitter Profile Photo

๐Ÿš€ We fixed a major LLM post-training bottleneck! Our new method (TBA) combines trajectory balance with asynchronous training to speed up LLM RL 5-50x while improving results+scalability. For example, using VinePPO's GSM8K setup, we obtain +1.2% accuracy and 50x faster RL.

๐Ÿš€ We fixed a major LLM post-training bottleneck! 

Our new method (TBA) combines trajectory balance with asynchronous training to speed up LLM RL 5-50x while improving results+scalability. 

For example, using VinePPO's GSM8K setup, we obtain +1.2% accuracy and 50x faster RL.
Partha Talukdar (@partha_p_t) 's Twitter Profile Photo

Google DeepMind India is hiring for research scientist role in multicultural & multimodal modeling. Strong candidates with proven research experience are encouraged to apply I shall be at #icassp2025 Hyderabad on Apr 8, happy to meet and chat, pls DM job-boards.greenhouse.io/deepmind/jobs/โ€ฆ

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

You write the script, Veo 2 brings it to life. ๐ŸŽฅ Starting today, Google Gemini App Advanced users can create stunning 8-second videos, in 720p cinematic quality, with just one text prompt. โœจ

Nithish Kannen (@nithishkannen) 's Twitter Profile Photo

Touching down in Singapore attending #ICLR2025 this week. If you're into Multimodal modelling, visual reasoning, RL for diffusion, multiculturality or anything remotely close - let's link up!! Always up for good discussions, food recs and better coffee ๐ŸŒŠ๐Ÿ–๏ธ๐Ÿ›โ˜•๏ธ

Touching down in Singapore attending #ICLR2025 this week. If you're into Multimodal modelling, visual reasoning, RL for diffusion, multiculturality or anything remotely close - let's link up!!  Always up for good discussions, food recs and better coffee ๐ŸŒŠ๐Ÿ–๏ธ๐Ÿ›โ˜•๏ธ
Nithish Kannen (@nithishkannen) 's Twitter Profile Photo

Super intriguing panel discussion at the SSI workshop Yoshua Bengio: The naming of the workshop โ€” is genuinely worrisome. Appreciate the spice and insights, Noah! #ICLR2025

Super intriguing panel discussion at the SSI workshop

<a href="/Yoshua_Bengio/">Yoshua Bengio</a>: The naming of the workshop โ€” is genuinely worrisome.

Appreciate the spice and insights, Noah!
#ICLR2025
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Watch Gemini 2.5 Pro implement a landmark Google DeepMind research paper. ๐Ÿ•น๏ธ It codes the reinforcement learning algorithm, visualizes the training live and even debugs errors. โ†“

Nithish Kannen (@nithishkannen) 's Twitter Profile Photo

Proactive T2I Agents is accepted to #ICML2025 ๐Ÿš€ We show it is possible to significantly improve multimodal generation (atleast 2x gains) with multi-turn agents that act based on uncertainty principles.

Zi Wang, Ph.D. (@ziwphd) 's Twitter Profile Photo

Our work on proactive text-to-image agents โ€“ enabling them to ask for clarification and share their understanding โ€“ has been accepted at #ICML2025! So proud of the team's efforts. See you in Vancouver! ๐Ÿ™Œ

Oriol Vinyals (@oriolvinyalsml) 's Twitter Profile Photo

Ahead of I/O, weโ€™re releasing an updated Gemini 2.5 Pro! Itโ€™s now #1 on WebDevArena leaderboard, breaking the 1400 ELO barrier! ๐Ÿฅ‡ Our most advanced coding model yet, with stronger performance on code transformation & editing. Excited to build drastic agents on top of this!

Ahead of I/O, weโ€™re releasing an updated Gemini 2.5 Pro! Itโ€™s now #1 on WebDevArena leaderboard, breaking the 1400 ELO barrier! ๐Ÿฅ‡

Our most advanced coding model yet, with stronger performance on code transformation &amp; editing. Excited to build drastic agents on top of this!
Aniket Rege (@wregss) 's Twitter Profile Photo

Abhipsa Basu Venkatesh Babu Danish Pruthi Simran Khanuja Nithish Kannen ๐Ÿšจ I'll be giving a short contributed talk about CuRe at CVPR 's DemoDiv workshop today (06/11) at 10 AM! Please drop by, I'd love to chat ๐Ÿ˜ x.com/polkirichenko/โ€ฆ

Harman Singh (@harman26singh) 's Twitter Profile Photo

๐Ÿšจ New Google DeepMind paper ๐‘๐จ๐›๐ฎ๐ฌ๐ญ ๐‘๐ž๐ฐ๐š๐ซ๐ ๐Œ๐จ๐๐ž๐ฅ๐ข๐ง๐  ๐ฏ๐ข๐š ๐‚๐š๐ฎ๐ฌ๐š๐ฅ ๐‘๐ฎ๐›๐ซ๐ข๐œ๐ฌ ๐Ÿ“‘ ๐Ÿ‘‰ arxiv.org/abs/2506.16507 We tackle reward hackingโ€”when RMs latch onto spurious cues (e.g. length, style) instead of true quality. #RLAIF #CausalInference ๐Ÿงตโฌ‡๏ธ

๐Ÿšจ New <a href="/GoogleDeepMind/">Google DeepMind</a> paper

๐‘๐จ๐›๐ฎ๐ฌ๐ญ ๐‘๐ž๐ฐ๐š๐ซ๐ ๐Œ๐จ๐๐ž๐ฅ๐ข๐ง๐  ๐ฏ๐ข๐š ๐‚๐š๐ฎ๐ฌ๐š๐ฅ ๐‘๐ฎ๐›๐ซ๐ข๐œ๐ฌ ๐Ÿ“‘
๐Ÿ‘‰ arxiv.org/abs/2506.16507

We tackle reward hackingโ€”when RMs latch onto spurious cues (e.g. length, style) instead of true quality.
#RLAIF #CausalInference

๐Ÿงตโฌ‡๏ธ