Koustuv Sinha (@koustuvsinha) Twitter Tweets • TwiCopy

The ML Reproducibility Challenge

22 days ago

Less than a week left to #MLRC2025! Here is the agenda for the day. Location - Friend Center @ Princeton University maps.app.goo.gl/Cq39Ju3qNYCMy3…

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Christopher Potts

@chrisgpotts

20 days ago

The Transformer is not quite a Ship of Theseus, but, since 2017, best practices have radically changed for positional encodings, layer norms, attention, residual streams, and MLP components. Architecture work is not dead, and scaling is not all you need to respect scaling laws.

thumb_up_off_alt238

chat_bubble_outline4

repeat18

shareShare

Oscar Mañas @ ICLR

@oscmansan

19 days ago

I’m happy to share that our paper "Controlling Multimodal LLMs via Reward-guided Decoding" has been accepted to #ICCV2025! 🎉 w/ Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, and Aishwarya Agrawal 🔗 Read more: arxiv.org/abs/2508.11616 🧵 Here's what we did:

I’m happy to share that our paper "Controlling Multimodal LLMs via Reward-guided Decoding" has been accepted to #ICCV2025! 🎉

w/ <a href="/proceduralia/">Pierluca D'Oro</a>, <a href="/koustuvsinha/">Koustuv Sinha</a>, <a href="/adri_romsor/">Adriana Romero-Soriano</a>, <a href="/michal_drozdzal/">Michal Drozdzal</a>, and <a href="/aagrawalAA/">Aishwarya Agrawal</a>

🔗 Read more: arxiv.org/abs/2508.11616

🧵 Here's what we did:

thumb_up_off_alt113

chat_bubble_outline2

repeat30

shareShare

The ML Reproducibility Challenge

@repro_challenge

19 days ago

#MLRC2025 happening this week (August 21st)! The first keynote talk [1/4] will be from Arvind Narayanan at 9.45AM EST

#MLRC2025 happening this week (August 21st)! The first keynote talk [1/4] will be from <a href="/random_walker/">Arvind Narayanan</a> at 9.45AM EST

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

The ML Reproducibility Challenge

@repro_challenge

19 days ago

#MLRC2025 happening this week (August 21st)! The second keynote talk [2/4] will be from Soumith Chintala at 10:30AM EST

#MLRC2025 happening this week (August 21st)! The second keynote talk [2/4] will be from <a href="/soumithchintala/">Soumith Chintala</a> at 10:30AM EST

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

The ML Reproducibility Challenge

@repro_challenge

19 days ago

#MLRC2025 happening this week (August 21st)! The third keynote talk [3/4] will be from Stella Biderman at 1pm EST

#MLRC2025 happening this week (August 21st)! The third keynote talk [3/4] will be from <a href="/BlancheMinerva/">Stella Biderman</a> at 1pm EST

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

The ML Reproducibility Challenge

@repro_challenge

19 days ago

#MLRC2025 happening this week (August 21st)! The fourth keynote talk [4/4] will be from Jonathan Frankle at 1.45PM EST

#MLRC2025 happening this week (August 21st)! The fourth keynote talk [4/4] will be from <a href="/jefrankle/">Jonathan Frankle</a> at 1.45PM EST

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Bryan Catanzaro

@ctnzr

19 days ago

Today we're releasing NVIDIA Nemotron Nano v2 - a 9B hybrid SSM that is 6X faster than similarly sized models, while also being more accurate. Along with this model, we are also releasing most of the data we used to create it, including the pretraining corpus. Links to the

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat228

shareShare

Koustuv Sinha

@koustuvsinha

17 days ago

Excited for tomorrow’s The ML Reproducibility Challenge #MLRC2025 event!! Really nice setup by Princeton Laboratory for Artificial Intelligence Princeton University Friend Center - looking forward to see you here tomorrow 9AM!

Excited for tomorrow’s <a href="/repro_challenge/">The ML Reproducibility Challenge</a> #MLRC2025 event!! Really nice setup by <a href="/PrincetonAInews/">Princeton Laboratory for Artificial Intelligence</a> <a href="/Princeton/">Princeton University</a> Friend Center - looking forward to see you here tomorrow 9AM!

thumb_up_off_alt20

chat_bubble_outline0

repeat4

shareShare

Adina Williams

@adinamwilliams

16 days ago

Awesome #MLRC2025 talks kicking us off this morning! I'm learning lots The ML Reproducibility Challenge about science with ML and reproducibility for real world applications (Arvind Narayanan), and software/firmware and data concerns for reproducibility (Soumith Chintala) Slides coming soon!

Awesome #MLRC2025 talks kicking us off this morning! I'm learning lots <a href="/repro_challenge/">The ML Reproducibility Challenge</a> about science with ML and reproducibility for real world applications (<a href="/random_walker/">Arvind Narayanan</a>), and software/firmware and data concerns for reproducibility (<a href="/soumithchintala/">Soumith Chintala</a>) Slides coming soon!

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

Koustuv Sinha

@koustuvsinha

15 days ago

Its a wrap - the first in-person event for #MLRC2025 successfully concluded yesterday - we witnessed some of the best talks I have ever heard on reproducibility issues in AI, ranging from issues regarding leakage and irreproducibility in ML-based science (Arvind Narayanan),

thumb_up_off_alt37

chat_bubble_outline0

repeat8

shareShare

Dan Jurafsky

@jurafsky

13 days ago

Now that school is starting for lots of folks, it's time for a new release of Speech and Language Processing! Jim and I added all sorts of material for the August 2025 release! With slides to match! Check it out here: web.stanford.edu/~jurafsky/slp3/

thumb_up_off_alt391

chat_bubble_outline7

repeat69

shareShare

Edward Grefenstette

@egrefen

11 days ago

On the contrary, when history remembers people who stood in the face of unbridled hype constructively and in line with the scientific method, let David be thus-remembered over the perpetual goalpost-moving Garys of the world.

thumb_up_off_alt9

chat_bubble_outline3

repeat1

shareShare

Koustuv Sinha

@koustuvsinha

11 days ago

Quality > Quantity. I’d love to see efforts to further reduce the size of pretraining data while keeping the downstream evals constant. BabyLM was a step in this direction, but the focus was to design the right architecture. We need a GPT speedrun equivalent of this, where the

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Bodhisattwa Majumder

@mbodhisattwa

10 days ago

So happy to share what we have been working on for the past 2 years Ai2. Data-driven discovery sits at the core of Asta, along with amazing literature discovery tools! You probably have read our works, now see them in action 🛼🛼

thumb_up_off_alt54

chat_bubble_outline4

repeat4

shareShare

Arvind Narayanan

@random_walker

9 days ago

I’m excited to announce I’ve started a YouTube channel. I plan to publish videos regularly explaining my views on AI and its present and future impacts. My first video asks: What happens if there’s an AI crash? youtube.com/watch?v=VDfyuB… This is my first foray into video (beyond

thumb_up_off_alt291

chat_bubble_outline13

repeat47

shareShare

Shiwei Liu

@shiwei_liu66

9 days ago

Happy to share a side project: Diffusion Language Models Know the Answer Before Decoding. Diffusion LMs are often dismissed as slow. But what if they already *know* the answer halfway through? 1. Early Answer Convergence: Our new paper shows that in many cases, they do,

thumb_up_off_alt227

chat_bubble_outline3

repeat38

shareShare

Koustuv Sinha

@koustuvsinha

9 days ago

Subscribed!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Arian Hosseini

@ariantbd

8 days ago

LLMs are great at single-shot problems, but in the era of experience, interactive environments are key 🔑 Introducing * Multi-Turn Puzzles (MTP) * , a new benchmark to test multi-turn reasoning and strategizing 🔗 Paper: huggingface.co/papers/2508.10… 🫙Data: huggingface.co/datasets/arian…

thumb_up_off_alt45

chat_bubble_outline1

repeat8

shareShare

Jakob Foerster

@j_foerst

4 days ago

Super excited about this event! I will give an updated version of my talk on the Simulation Hypothesis - i.e. Machine Learning in the upcoming era of extremely fast computers. How can we do science that stands the test of time when compute capacity is accelerating?

thumb_up_off_alt62

chat_bubble_outline2

repeat6

shareShare