Nithish Kannen (@nithishkannen) Twitter Tweets • TwiCopy

Brian Bartoldson

8 months ago

🚀 We fixed a major LLM post-training bottleneck! Our new method (TBA) combines trajectory balance with asynchronous training to speed up LLM RL 5-50x while improving results+scalability. For example, using VinePPO's GSM8K setup, we obtain +1.2% accuracy and 50x faster RL.

thumb_up_off_alt256

chat_bubble_outline3

repeat49

shareShare

Nithish Kannen

@nithishkannen

8 months ago

🚀 really excited by the multimodal reasoning capabilities of this model!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Nithish Kannen

@nithishkannen

8 months ago

This is really cool!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Partha Talukdar

@partha_p_t

8 months ago

Google DeepMind India is hiring for research scientist role in multicultural & multimodal modeling. Strong candidates with proven research experience are encouraged to apply I shall be at #icassp2025 Hyderabad on Apr 8, happy to meet and chat, pls DM job-boards.greenhouse.io/deepmind/jobs/…

thumb_up_off_alt441

chat_bubble_outline4

repeat52

shareShare

Nithish Kannen

@nithishkannen

8 months ago

Great opportunity for MS/PhD students working on speech/language. Sumanth and team work on cool stuff and are really fun!!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Google DeepMind

@googledeepmind

7 months ago

You write the script, Veo 2 brings it to life. 🎥 Starting today, Google Gemini App Advanced users can create stunning 8-second videos, in 720p cinematic quality, with just one text prompt. ✨

thumb_up_off_alt1,1K

chat_bubble_outline74

repeat186

shareShare

Nithish Kannen

@nithishkannen

7 months ago

Touching down in Singapore attending #ICLR2025 this week. If you're into Multimodal modelling, visual reasoning, RL for diffusion, multiculturality or anything remotely close - let's link up!! Always up for good discussions, food recs and better coffee 🌊🏖️🍛☕️

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Nithish Kannen

@nithishkannen

7 months ago

Super intriguing panel discussion at the SSI workshop Yoshua Bengio: The naming of the workshop — is genuinely worrisome. Appreciate the spice and insights, Noah! #ICLR2025

Super intriguing panel discussion at the SSI workshop

<a href="/Yoshua_Bengio/">Yoshua Bengio</a>: The naming of the workshop — is genuinely worrisome.

Appreciate the spice and insights, Noah!
#ICLR2025

thumb_up_off_alt34

chat_bubble_outline0

repeat4

shareShare

Google DeepMind

@googledeepmind

7 months ago

Watch Gemini 2.5 Pro implement a landmark Google DeepMind research paper. 🕹️ It codes the reinforcement learning algorithm, visualizes the training live and even debugs errors. ↓

thumb_up_off_alt2,2K

chat_bubble_outline57

repeat399

shareShare

Nithish Kannen

@nithishkannen

7 months ago

Proactive T2I Agents is accepted to #ICML2025 🚀 We show it is possible to significantly improve multimodal generation (atleast 2x gains) with multi-turn agents that act based on uncertainty principles.

thumb_up_off_alt22

chat_bubble_outline0

repeat2

shareShare

Zi Wang, Ph.D.

@ziwphd

7 months ago

Our work on proactive text-to-image agents – enabling them to ask for clarification and share their understanding – has been accepted at #ICML2025! So proud of the team's efforts. See you in Vancouver! 🙌

thumb_up_off_alt19

chat_bubble_outline1

repeat5

shareShare

Oriol Vinyals

@oriolvinyalsml

7 months ago

Ahead of I/O, we’re releasing an updated Gemini 2.5 Pro! It’s now #1 on WebDevArena leaderboard, breaking the 1400 ELO barrier! 🥇 Our most advanced coding model yet, with stronger performance on code transformation & editing. Excited to build drastic agents on top of this!

thumb_up_off_alt761

chat_bubble_outline35

repeat64

shareShare

testtm

@test_tm7873

7 months ago

Gemini-2.5-Pro-preview-05-06 isn't only the best model for coding. its the best model for everything! across all tasks!

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat121

shareShare

Medhini Narasimhan

@medhini_n

6 months ago

Physics physics physics! #veo3

thumb_up_off_alt948

chat_bubble_outline21

repeat76

shareShare

Shlomi Fruchter

@shlomifruchter

6 months ago

Oh oh, they found out it can rap :)

thumb_up_off_alt13

chat_bubble_outline1

repeat1

shareShare

Similarweb

@similarweb

6 months ago

The Veo3 effect on traffic to Google DeepMind. Have you tried it yet?

The Veo3 effect on traffic to <a href="/GoogleDeepMind/">Google DeepMind</a>.

Have you tried it yet?

thumb_up_off_alt441

chat_bubble_outline15

repeat47

shareShare

Aniket Rege

@wregss

6 months ago

Abhipsa Basu Venkatesh Babu Danish Pruthi Simran Khanuja Nithish Kannen 🚨 I'll be giving a short contributed talk about CuRe at CVPR 's DemoDiv workshop today (06/11) at 10 AM! Please drop by, I'd love to chat 😁 x.com/polkirichenko/…

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Harman Singh

@harman26singh

5 months ago

🚨 New Google DeepMind paper 𝐑𝐨𝐛𝐮𝐬𝐭 𝐑𝐞𝐰𝐚𝐫𝐝 𝐌𝐨𝐝𝐞𝐥𝐢𝐧𝐠 𝐯𝐢𝐚 𝐂𝐚𝐮𝐬𝐚𝐥 𝐑𝐮𝐛𝐫𝐢𝐜𝐬 📑 👉 arxiv.org/abs/2506.16507 We tackle reward hacking—when RMs latch onto spurious cues (e.g. length, style) instead of true quality. #RLAIF #CausalInference 🧵⬇️

🚨 New <a href="/GoogleDeepMind/">Google DeepMind</a> paper

𝐑𝐨𝐛𝐮𝐬𝐭 𝐑𝐞𝐰𝐚𝐫𝐝 𝐌𝐨𝐝𝐞𝐥𝐢𝐧𝐠 𝐯𝐢𝐚 𝐂𝐚𝐮𝐬𝐚𝐥 𝐑𝐮𝐛𝐫𝐢𝐜𝐬 📑
👉 arxiv.org/abs/2506.16507

We tackle reward hacking—when RMs latch onto spurious cues (e.g. length, style) instead of true quality.
#RLAIF #CausalInference

🧵⬇️

thumb_up_off_alt114

chat_bubble_outline4

repeat23

shareShare