Mehul Damani @ ICLR (@mehuldamani2) Twitter Tweets • TwiCopy

Mehul Damani @ ICLR

@mehuldamani2

+ Follow

PhD Student at MIT | Reinforcement Learning, NLP

ID: 2784283957

linkhttps://damanimehul.github.io/ calendar_today01-09-2014 15:08:58

30 Tweet

227 Followers

274 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Linlu Qiu

@linluqiu

9 months ago

It was a great pleasure working on this project with amazing collaborators! Excited to see more opportunities opened up by scaling test-time compute!

thumb_up_off_alt40

chat_bubble_outline0

repeat4

shareShare

Thanks for the attention, couple important points: 1) See Jack Cole, their team is the first one who applied method privately and they get the 1st rank in the competition. 2) See the concurrent work as well: x.com/ellisk_kellis/… 3) Obviously this is not AGI, it's a

thumb_up_off_alt164

chat_bubble_outline6

repeat14

shareShare

Noam Brown

@polynoamial

9 months ago

With OpenAI o1, we developed one way to scale test-time compute, but it isn't the only way and might not be the best way. I'm excited to see academic researchers explore new approaches in this direction.

thumb_up_off_alt1,1K

chat_bubble_outline28

repeat96

shareShare

Seungwook Han

@seungwookh

7 months ago

🧩 Why do task vectors exist in pretrained LLMs? Our new research uncovers how transformers form internal abstractions and the mechanisms behind in-context learning(ICL).

thumb_up_off_alt191

chat_bubble_outline6

repeat30

shareShare

Isha Puri

@ishapuri101

6 months ago

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint MIT CSAIL / Red Hat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint <a href="/MIT_CSAIL/">MIT CSAIL</a> / <a href="/RedHat/">Red Hat</a> AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

thumb_up_off_alt229

chat_bubble_outline2

repeat68

shareShare

Jeremy Bernstein

@jxbz

5 months ago

I just wrote my first blog post in four years! It is called "Deriving Muon". It covers the theory that led to Muon and how, for me, Muon is a meaningful example of theory leading practice in deep learning (1/11)

thumb_up_off_alt885

chat_bubble_outline10

repeat128

shareShare

idan shenfeld

@idanshenfeld

4 months ago

The next frontier for AI shouldn’t just be generally helpful. It should be helpful for you! Our new paper shows how to personalize LLMs — efficiently, scalably, and without retraining. Meet PReF (arxiv.org/abs/2503.06358) 1\n

thumb_up_off_alt52

chat_bubble_outline2

repeat28

shareShare

Mehul Damani @ ICLR

@mehuldamani2

3 months ago

I am super excited to be presenting our work on adaptive inference -time compute at ICLR! Come chat with me on Thursday 4/24 at 3PM (Poster #219). I am also happy to chat about RL/reasoning/ RLHF/ inference scaling (DMs are open)!

thumb_up_off_alt21

chat_bubble_outline0

repeat7

shareShare

Akarsh Kumar

@akarshkumar0101

2 months ago

Excited to share our position paper on the Fractured Entangled Representation (FER) Hypothesis! We hypothesize that the standard paradigm of training networks today — while producing impressive benchmark results — is still failing to create a well-organized internal

thumb_up_off_alt224

chat_bubble_outline5

repeat36

shareShare

Adam Zweiger

@adamzweiger

13 days ago

Come check out our ICML poster on combining Test-Time Training and In-Context Learning for on-the-fly adaptation to novel tasks like ARC-AGI puzzles. I will be presenting with Jyo Pari at E-2702, Tuesday 11-1:30!

thumb_up_off_alt32

chat_bubble_outline1

repeat5

shareShare

Jacob Andreas

@jacobandreas

4 days ago

👉 New preprint! Today, many the biggest challenges in LM post-training aren't just about correctness, but rather consistency & coherence across interactions. This paper tackles some of these issues by optimizing reasoning LMs for calibration rather than accuracy...

thumb_up_off_alt95

chat_bubble_outline2

repeat11

shareShare

Mehul Damani @ ICLR

Gate.io

Linlu Qiu

Ekin Akyürek

Noam Brown

Seungwook Han

Isha Puri

Jeremy Bernstein

idan shenfeld

Mehul Damani @ ICLR

Akarsh Kumar

Adam Zweiger

Jacob Andreas