Sachin Goyal (@goyalsachin007) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Sachin Goyal

@goyalsachin007

3 months ago

We show some pretty intriguing results with massive implications! Tune into Aditi’s talk at SCSL workshop, 1:30pm on Monday, Garnet 214-215 (Floor 2). #ICLR2025

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Building LLM Agents? Come to my talk at the #ICLR DATA-FM workshop today at 2:30pm, Hall 4, Section 4. I'll be presenting InSTA, our work building the largest environment for agents on the live internet. arxiv.org/abs/2502.06776 #Agents #LLM

thumb_up_off_alt44

chat_bubble_outline2

repeat12

shareShare

Yutong (Kelly) He

@electronickale

3 months ago

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵

thumb_up_off_alt83

chat_bubble_outline2

repeat31

shareShare

Jacob Springer

@jacspringer

3 months ago

Our paper on how overtraining LLMs can make fine-tuning harder won awards at two different #ICLR2025 workshops! I'm honored and thrilled! Outstanding paper @ SCOPE Entropic Paper Award @ ICBINB

thumb_up_off_alt125

chat_bubble_outline3

repeat12

shareShare

Divyat Mahajan

@divyat09

2 months ago

Happy to share that Compositional Risk Minimization has been accepted at #ICML2025 📌Extensive theoretical analysis along with a practical approach for extrapolating classifiers to novel compositions! 📜 arxiv.org/abs/2410.06303

thumb_up_off_alt159

chat_bubble_outline4

repeat31

shareShare

Pratyush Maini

@pratyushmaini

2 months ago

Looking forward to giving a talk this Friday OpenAI with Zhili Feng on some of our privacy & memorization research + how it applies to production LLMs! We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

Looking forward to giving a talk this Friday <a href="/OpenAI/">OpenAI</a> with <a href="/zhilifeng/">Zhili Feng</a> on some of our privacy & memorization research + how it applies to production LLMs!

We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

thumb_up_off_alt101

chat_bubble_outline0

repeat10

shareShare

Equalyz_AI

@equalyz_ai

2 months ago

Winner of the Entropic Paper Award at ICLR 2026 #ICLR2025 Groundbreaking research by Jacob Mitchell Springer Jacob Springer (CMU), Sachin Goyal Sachin Goyal (CMU), Kaiyue Wen Kaiyue Wen (Stanford), Tanishq Kumar (Harvard), Xiang Yue Xiang Yue (CMU), Sadhika Malladi

Winner of the Entropic Paper Award at <a href="/iclr_conf/">ICLR 2026</a> #ICLR2025

Groundbreaking research by Jacob Mitchell Springer
<a href="/jacspringer/">Jacob Springer</a> (CMU), Sachin Goyal <a href="/goyalsachin007/">Sachin Goyal</a> (CMU), Kaiyue Wen <a href="/wen_kaiyue/">Kaiyue Wen</a> (Stanford), Tanishq Kumar (Harvard), Xiang Yue <a href="/xiangyue96/">Xiang Yue</a> (CMU), Sadhika Malladi

thumb_up_off_alt30

chat_bubble_outline0

repeat6

shareShare

Alex Dimakis

@alexgdimakis

2 months ago

"RL with only one training example" and "Test-Time RL" are two recent papers that I found fascinating. In the "One Training example" paper the authors find one question and ask the model to solve it again and again. Every time, the model tries 8 times (the Group in GRPO), and

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat190

shareShare

Zhengyang Geng

@zhengyanggeng

2 months ago

Excited to share our work with my amazing collaborators, Goodeat, Xingjian Bai, Zico Kolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

Excited to share our work with my amazing collaborators, <a href="/Goodeat258/">Goodeat</a>, <a href="/SimulatedAnneal/">Xingjian Bai</a>, <a href="/zicokolter/">Zico Kolter</a>, and Kaiming.

In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

thumb_up_off_alt111

chat_bubble_outline4

repeat28

shareShare

Sachin Goyal

@goyalsachin007

a month ago

Very interesting work!

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Vaishnavh Nagarajan

@_vaishnavh

a month ago

follow Aditi Raghunathan for all the other exciting LLM stuff going on in Aditi's group! i owe a lot to the group & its students. :) they have the most liveliest slack channel I've been on, keeping me up to date with all the latest AI stuff, great ideas & thoughtful discussions

thumb_up_off_alt23

chat_bubble_outline1

repeat2

shareShare

Sachin Goyal

@goyalsachin007

a month ago

So true!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Pratyush Kumar

@pratykumar

a month ago

New model drop - Sarvam-Translate is here. Can translate between 22 Indian languages & English. Significantly better than much larger models. Improves on nuance, long-form, structured text. Available as super-fast APIs. Try it here: dashboard.sarvam.ai/translate

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat208

shareShare

Aditi Raghunathan

@adtraghunathan

a month ago

Excited to speak at the CVPR workshop on domain generalization! Estimating model performance in the wild is hard but crucial. I'll present agreement-on-the-line, a simple and surprisingly powerful phenomenon. It is easily one of the most intriguing things I've studied.

thumb_up_off_alt25

chat_bubble_outline2

repeat4

shareShare

Vaishnavh Nagarajan

@_vaishnavh

a month ago

Wrote my first blog post! I wanted to share a powerful yet under-recognized way to develop emotional maturity as a researcher: making it a habit to read about the ✨past ✨ and learn from it to make sense of the present

thumb_up_off_alt96

chat_bubble_outline1

repeat13

shareShare

Yuandong Tian

@tydsh

a month ago

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat159

shareShare

Sachin Goyal

@goyalsachin007

9 days ago

Pressing reload on 10's of VS Code windows in the morning is the adult version of reload in Counter-Strike 🔃.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Pratyush Maini

@pratyushmaini

2 days ago

All set for the #ICML grind after the #GrouseGrind⛰️ Eager to discuss my recent works on quantifying, detecting, & eliminating model memorization. I'll be at one of the events below when not at the DatologyAI booth. Pls DM if you'd like to chat about data quality or privacy!

thumb_up_off_alt76

chat_bubble_outline2

repeat4

shareShare

Divyat Mahajan

@divyat09

2 days ago

Presenting CRM at #ICML2025 📌 Wednesday, 16th July, 11 am 📍East Exhibition Hall A-B (E-2101) Lets chat about distribution shifts! Been deep into causality & invariance based perspectives, and recently exploring robust LLM pretraining architectures.

thumb_up_off_alt42

chat_bubble_outline0

repeat8

shareShare

Akari Asai

@akariasai

18 hours ago

Some updates 🚨 I finished my Ph.D at Allen School in June 2025! After a year at AI2 as a Research Scientist, I am joining CMU Language Technologies Institute | @CarnegieMellon & Machine Learning Dept. at Carnegie Mellon (courtesy) as an Assistant Professor in Fall 2026. The journey, acknowledgments & recruiting in 🧵

Some updates 🚨
I finished my Ph.D at <a href="/uwcse/">Allen School</a> in June 2025!
After a year at AI2 as a Research Scientist, I am joining CMU <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a> & <a href="/mldcmu/">Machine Learning Dept. at Carnegie Mellon</a> (courtesy) as an Assistant Professor in Fall 2026.
The journey, acknowledgments & recruiting in 🧵

thumb_up_off_alt1,1K

chat_bubble_outline85

repeat52

shareShare

Sachin Goyal

Gate.io

Sachin Goyal

Brandon Trabucco @ ICLR

Yutong (Kelly) He

Jacob Springer

Divyat Mahajan

Pratyush Maini

Equalyz_AI

Alex Dimakis

Zhengyang Geng

Sachin Goyal

Vaishnavh Nagarajan

Sachin Goyal

Pratyush Kumar

Aditi Raghunathan

Vaishnavh Nagarajan

Yuandong Tian

Sachin Goyal

Pratyush Maini

Divyat Mahajan

Akari Asai