Jonathan Lai (@_jlai) Twitter Tweets • TwiCopy

Jonathan Lai

a year ago

🚨✨ Thrilled to share the first study on model merging at large scales by our intern Prateek Yadav Google AI Google DeepMind For larger LLMs merging is an efficient alternative to multitask learning, that can preserve the majority of in-domain performance, while significantly

thumb_up_off_alt20

chat_bubble_outline0

repeat2

shareShare

Prateek Yadav

@prateeky2806

a year ago

I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modular models - Synthetic Data - Test time compute - other phases of pre/post-training. If you are not hiring then I would appreciate a retweet! More details👇

thumb_up_off_alt213

chat_bubble_outline8

repeat60

shareShare

Jonathan Lai

@_jlai

a year ago

Prateek is an amazing researcher!! Definitely hire him!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Jonathan Lai

@_jlai

10 months ago

Try our new experimental thinking model at aistudio.google.com/prompts/new_ch… !! Appreciate any and all feedback

thumb_up_off_alt20

chat_bubble_outline2

repeat0

shareShare

Logan Kilpatrick

@officiallogank

9 months ago

We are rolling out a new Gemini 2.0 Flash Thinking update: - Exp-01-21 variant in AI Studio and API for free - 1 million token context window - Native code execution support - Longer output token generation - Less frequent model contradictions Try it aistudio.google.com

thumb_up_off_alt3,3K

chat_bubble_outline221

repeat327

shareShare

Sundar Pichai

@sundarpichai

7 months ago

1/ Gemini 2.5 is here, and it’s our most intelligent AI model ever. Our first 2.5 model, Gemini 2.5 Pro Experimental is a state-of-the-art thinking model, leading in a wide range of benchmarks – with impressive improvements in enhanced reasoning and coding and now #1 on

thumb_up_off_alt7,7K

chat_bubble_outline313

repeat1,1K

shareShare

Jonathan Lai

@_jlai

7 months ago

A historic elo margin on LMSYS and also crushed almost all reasoning and STEM benchmarks!! So proud of this team!!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Tsendsuren

@tsendeemts

7 months ago

Gemini-ийн шинэ загварыг туршаад үзээрэй. Код бичих дээр нилээн сайжирсан байгаа.

thumb_up_off_alt18

chat_bubble_outline0

repeat8

shareShare

Ankesh Anand

@ankesh_anand

7 months ago

shoutout to the believers!

thumb_up_off_alt1,1K

chat_bubble_outline46

repeat71

shareShare

Tu Vu

@tuvllms

7 months ago

🚨 New paper 🚨 Excited to share my first paper w/ my PhD students!! We find that advanced LLM capabilities conferred by instruction or alignment tuning (e.g., SFT, RLHF, DPO, GRPO) can be encoded into model diff vectors (à la task vectors) and transferred across model

thumb_up_off_alt438

chat_bubble_outline12

repeat89

shareShare

Tu Vu

@tuvllms

4 months ago

Excited to share that our paper on model merging at scale has been accepted to Transactions on Machine Learning Research (TMLR). Huge congrats to my intern Prateek Yadav and our awesome co-authors Jonathan Lai, Alexandra Chronopoulou, Manaal Faruqui, Mohit Bansal, and Tsendsuren 🎉!!