Eran Malach (@eranmalach) Twitter Tweets • TwiCopy

Eran Malach

@eranmalach

a year ago

Will be presenting this work at #NeurIPS2024, today 11am, poster #2311. Come visit us!

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Eran Malach

@eranmalach

a year ago

Presenting this work at #NeurIPS2024 today 4:30pm session (poster #4807, east). Come by to hear about auto-regressive decision trees for language modeling!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Kempner Institute at Harvard University

@kempnerinst

a year ago

The 4:30pm poster session today at #NeurIPS2024 will feature "The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains," by Ezra Edelman, Nikolaos Tsilivis, Surbhi Goel, Ben Edelman and Eran Malach. #KempnerInstitute

thumb_up_off_alt10

chat_bubble_outline1

repeat4

shareShare

Rana Shahout

@rana_shahout

10 months ago

New paper at #ICLR2025! Fast LLM inference = smart scheduling 🕒 but size-based scheduling (prioritizing short requests over long ones) requires knowing request sizes—a challenging task in LLM systems. So, how can we predict request sizes accurately? 🔗openreview.net/forum?id=7JhGd…)

thumb_up_off_alt64

chat_bubble_outline2

repeat15

shareShare

Eran Malach

@eranmalach

7 months ago

To backtrack or not to backtrack? The answer depends on the nature of the reasoning problem! Check out our paper new paper, led by Sunny Qin, with David Alvarez Melis and Samy Jelassi: arxiv.org/abs/2504.07052 See thread below 👇👇

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Nathan Lambert

@natolambert

7 months ago

The best part of RLs focus in post-training right now is that the elicitation idea of post-training is a much better match to large-scale pretraining. Instruction and preference tuning are still crucial to good models, in tasteful amounts. More research on this:

thumb_up_off_alt322

chat_bubble_outline8

repeat32

shareShare

Kempner Institute at Harvard University

@kempnerinst

7 months ago

New in the Deeper Learning blog: Kempner researchers dive into how LLMs reason with the help of "backtracking," and explore alternative ways to enhance LLM reasoning capabilities. bit.ly/KempnerBacktra… #AI #ML #LLMs By Sunny Qin, David Alvarez Melis, Samy Jelassi, and Eran Malach

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Kempner Institute at Harvard University

@kempnerinst

7 months ago

New in the Deeper Learning blog: Kempner researchers show how VLMs speak the same semantic language across images and text. bit.ly/KempnerVLM by Isabel Papadimitriou, Chloe H. Su, Thomas Fel, Stephanie Gil, and Sham Kakade #AI #ML #VLMs #SAEs

thumb_up_off_alt23

chat_bubble_outline0

repeat17

shareShare

MOSS

@moss_workshop

7 months ago

Announcing the 1st Workshop on Methods and Opportunities at Small Scale (MOSS) at ICML Conference 2025! 🔗Website: sites.google.com/view/moss2025 📝 We welcome submissions! 📅 Paper & jupyter notebook deadline: May 22, 2025 Topics: – Inductive biases & generalization – Training

Announcing the 1st Workshop on Methods and Opportunities at Small Scale (MOSS) at <a href="/icmlconf/">ICML Conference</a> 2025!

🔗Website: sites.google.com/view/moss2025

📝 We welcome submissions!
📅 Paper & jupyter notebook deadline: May 22, 2025

Topics:
– Inductive biases & generalization
– Training

thumb_up_off_alt42

chat_bubble_outline0

repeat11

shareShare

Surbhi Goel

@surbhigoel_

7 months ago

Super excited to announce our ICML workshop on highlighting the power (and limitations?) of small-scale in the era of large-scale ML. You can submit just a Jupyter notebook, Jupyter notebook + paper, or a survey/position paper. Do submit your work and help us spread the word!

thumb_up_off_alt61

chat_bubble_outline1

repeat13

shareShare

Bingbin Liu

@bingbinl

7 months ago

Excited to announce MOSS, our ICML workshop focused on discoveries at small scale! We believe there's tremendous potential & creativity in research done with limited resources and would love to hear your ideas. The submission (due May 22nd) can literally be a Jupyter notebook! :)

thumb_up_off_alt116

chat_bubble_outline0

repeat12

shareShare

MOSS

@moss_workshop

6 months ago

We are extending the deadline to May 26th 4:59pm PDT (11:59pm UTC). Thank you everyone for your interest & inquiries; we look forward to learning about your results! 🪄

thumb_up_off_alt11

chat_bubble_outline0

repeat7

shareShare

Antonio Orvieto

@orvieto_antonio

6 months ago

We have a new SSM theory paper, just accepted to COLT, revisiting recall properties of linear RNNs. It's surprising how much one can delve into, and how beautiful it can become. With (and only thanks to) the amazing Alexandre and Francis Bach arxiv.org/pdf/2502.09287

thumb_up_off_alt100

chat_bubble_outline2

repeat26

shareShare

Noah Golowich

@golowichnoah

4 months ago

I'll be attending ICML this week; come stop by our poster on length generalization in LLMs on Tuesday morning (poster session 1 west)! Paper link: openreview.net/forum?id=S9LkB…

thumb_up_off_alt26

chat_bubble_outline0

repeat4

shareShare