Sachin Goyal (@goyalsachin007) 's Twitter Profile
Sachin Goyal

@goyalsachin007

PhD student @ CMU MLD || Microsoft Research || UG @ IIT Bombay

ID: 4562995039

linkhttp://saching007.github.io calendar_today15-12-2015 05:11:53

445 Tweet

1,1K Followers

693 Following

Sachin Goyal (@goyalsachin007) 's Twitter Profile Photo

We show some pretty intriguing results with massive implications! Tune into Aditi’s talk at SCSL workshop, 1:30pm on Monday, Garnet 214-215 (Floor 2). #ICLR2025

Brandon Trabucco @ ICLR (@brandontrabucco) 's Twitter Profile Photo

Building LLM Agents? Come to my talk at the #ICLR DATA-FM workshop today at 2:30pm, Hall 4, Section 4. I'll be presenting InSTA, our work building the largest environment for agents on the live internet. arxiv.org/abs/2502.06776 #Agents #LLM

Building LLM Agents? Come to my talk at the #ICLR DATA-FM workshop today at 2:30pm, Hall 4, Section 4.

I'll be presenting InSTA, our work building the largest environment for agents on the live internet.

arxiv.org/abs/2502.06776

#Agents #LLM
Yutong (Kelly) He (@electronickale) 's Twitter Profile Photo

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵

Jacob Springer (@jacspringer) 's Twitter Profile Photo

Our paper on how overtraining LLMs can make fine-tuning harder won awards at two different #ICLR2025 workshops! I'm honored and thrilled! Outstanding paper @ SCOPE Entropic Paper Award @ ICBINB

Our paper on how overtraining LLMs can make fine-tuning harder won awards at two different #ICLR2025 workshops! I'm honored and thrilled!
Outstanding paper @ SCOPE
Entropic Paper Award @ ICBINB
Divyat Mahajan (@divyat09) 's Twitter Profile Photo

Happy to share that Compositional Risk Minimization has been accepted at #ICML2025 📌Extensive theoretical analysis along with a practical approach for extrapolating classifiers to novel compositions! 📜 arxiv.org/abs/2410.06303

Happy to share that Compositional Risk Minimization has been accepted at #ICML2025

📌Extensive theoretical analysis along with a practical approach for extrapolating classifiers to novel compositions!

📜 arxiv.org/abs/2410.06303
Pratyush Maini (@pratyushmaini) 's Twitter Profile Photo

Looking forward to giving a talk this Friday OpenAI with Zhili Feng on some of our privacy & memorization research + how it applies to production LLMs! We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

Looking forward to giving a talk this Friday <a href="/OpenAI/">OpenAI</a> with <a href="/zhilifeng/">Zhili Feng</a> on some of our privacy &amp; memorization research + how it applies to production LLMs! 

We've been gaining momentum on detecting, quantifying &amp; erasing memorization;  excited to explore its real-world impact!
Equalyz_AI (@equalyz_ai) 's Twitter Profile Photo

Winner of the Entropic Paper Award at ICLR 2026 #ICLR2025 Groundbreaking research by Jacob Mitchell Springer Jacob Springer (CMU), Sachin Goyal Sachin Goyal (CMU), Kaiyue Wen Kaiyue Wen (Stanford), Tanishq Kumar (Harvard), Xiang Yue Xiang Yue (CMU), Sadhika Malladi

Winner of the Entropic Paper Award at  <a href="/iclr_conf/">ICLR 2026</a>   #ICLR2025 

Groundbreaking research by Jacob Mitchell Springer 
 <a href="/jacspringer/">Jacob Springer</a> (CMU), Sachin Goyal <a href="/goyalsachin007/">Sachin Goyal</a> (CMU), Kaiyue Wen <a href="/wen_kaiyue/">Kaiyue Wen</a> (Stanford), Tanishq Kumar (Harvard), Xiang Yue <a href="/xiangyue96/">Xiang Yue</a> (CMU), Sadhika Malladi
Alex Dimakis (@alexgdimakis) 's Twitter Profile Photo

"RL with only one training example" and "Test-Time RL" are two recent papers that I found fascinating. In the "One Training example" paper the authors find one question and ask the model to solve it again and again. Every time, the model tries 8 times (the Group in GRPO), and

"RL with only one training example" and "Test-Time RL" are two recent papers that I found fascinating. 

In the "One Training example" paper 
the authors find one question and ask the model to solve it again and again. Every time, the model tries 8 times (the Group in GRPO), and
Zhengyang Geng (@zhengyanggeng) 's Twitter Profile Photo

Excited to share our work with my amazing collaborators, Goodeat, Xingjian Bai, Zico Kolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

Excited to share our work with my amazing collaborators, <a href="/Goodeat258/">Goodeat</a>, <a href="/SimulatedAnneal/">Xingjian Bai</a>, <a href="/zicokolter/">Zico Kolter</a>, and Kaiming.

In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,
Vaishnavh Nagarajan (@_vaishnavh) 's Twitter Profile Photo

follow Aditi Raghunathan for all the other exciting LLM stuff going on in Aditi's group! i owe a lot to the group & its students. :) they have the most liveliest slack channel I've been on, keeping me up to date with all the latest AI stuff, great ideas & thoughtful discussions

Pratyush Kumar (@pratykumar) 's Twitter Profile Photo

New model drop - Sarvam-Translate is here. Can translate between 22 Indian languages & English. Significantly better than much larger models. Improves on nuance, long-form, structured text. Available as super-fast APIs. Try it here: dashboard.sarvam.ai/translate

Aditi Raghunathan (@adtraghunathan) 's Twitter Profile Photo

Excited to speak at the CVPR workshop on domain generalization! Estimating model performance in the wild is hard but crucial. I'll present agreement-on-the-line, a simple and surprisingly powerful phenomenon. It is easily one of the most intriguing things I've studied.

Vaishnavh Nagarajan (@_vaishnavh) 's Twitter Profile Photo

Wrote my first blog post! I wanted to share a powerful yet under-recognized way to develop emotional maturity as a researcher: making it a habit to read about the ✨past ✨ and learn from it to make sense of the present

Wrote my first blog post! I wanted to share a powerful yet under-recognized way to develop emotional maturity as a researcher: 

making it a habit to read about the ✨past ✨ and learn from it to make sense of the present
Yuandong Tian (@tydsh) 's Twitter Profile Photo

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability

Pratyush Maini (@pratyushmaini) 's Twitter Profile Photo

All set for the #ICML grind after the #GrouseGrind⛰️ Eager to discuss my recent works on quantifying, detecting, & eliminating model memorization. I'll be at one of the events below when not at the DatologyAI booth. Pls DM if you'd like to chat about data quality or privacy!

All set for the #ICML grind after the #GrouseGrind⛰️ 

Eager to discuss my recent works on quantifying, detecting, &amp; eliminating model memorization.

I'll be at one of the events below when not at the <a href="/datologyai/">DatologyAI</a> booth. Pls DM if you'd like to chat about data quality or privacy!
Divyat Mahajan (@divyat09) 's Twitter Profile Photo

Presenting CRM at #ICML2025 📌 Wednesday, 16th July, 11 am 📍East Exhibition Hall A-B (E-2101) Lets chat about distribution shifts! Been deep into causality & invariance based perspectives, and recently exploring robust LLM pretraining architectures.

Presenting CRM at #ICML2025 

📌 Wednesday,  16th July, 11 am
📍East Exhibition Hall A-B (E-2101)

Lets chat about distribution shifts! Been deep into causality &amp; invariance based perspectives, and recently exploring robust LLM pretraining architectures.
Akari Asai (@akariasai) 's Twitter Profile Photo

Some updates 🚨 I finished my Ph.D at Allen School in June 2025! After a year at AI2 as a Research Scientist, I am joining CMU Language Technologies Institute | @CarnegieMellon & Machine Learning Dept. at Carnegie Mellon (courtesy) as an Assistant Professor in Fall 2026. The journey, acknowledgments & recruiting in 🧵

Some updates 🚨
I finished my Ph.D at <a href="/uwcse/">Allen School</a> in June 2025!
After a year at AI2 as a Research Scientist, I am joining CMU <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a> &amp; <a href="/mldcmu/">Machine Learning Dept. at Carnegie Mellon</a> (courtesy) as an Assistant Professor in Fall 2026.
The journey, acknowledgments &amp; recruiting in 🧵