Noam Dahan (@dahan_noam) 's Twitter Profile
Noam Dahan

@dahan_noam

CS MSc student @nlphuji researching NLP | Former news editor @Haaretz

ID: 2340642482

calendar_today12-02-2014 16:57:35

43 Tweet

284 Followers

270 Following

Asaf Yehudai (@asafyehudai) 's Twitter Profile Photo

New preprint! ✨ Interested in LLM-as-a-Judge? Want to get the best judge for ranking your system? our new work is just for you: "JuStRank: Benchmarking LLM Judges for System Ranking" 🕺💃 arxiv.org/abs/2412.09569

Eliya Habba (@eliyahabba) 's Twitter Profile Photo

🌍 AI is changing the world. Is AI regulation on the right track? 🤔 While regulators rely on benchmarking 📊, we show why it cannot guarantee AI behavior: arxiv.org/pdf/2501.15693 Excited about this multidisciplinary collaboration! Gabriel Stanovsky, Renana Keydar , Gadi Perl

Gallil Maimon (@gallilmaimon) 's Twitter Profile Photo

🗣️🧠 Speech Language Models require lots of compute to train, right? In our new paper, we test is it possible to train an SLM on 1xA5000 gpu in 24 hours? The results may surprise you (they even surprised us)! Tips, open source resources, full paper 👇🏻

🗣️🧠 Speech Language Models require lots of compute to train, right? 
In our new paper, we test is it possible to train an SLM on 1xA5000 gpu in 24 hours? 
The results may surprise you (they even surprised us)!
Tips, open source resources, full paper 👇🏻
Eliya Habba (@eliyahabba) 's Twitter Profile Photo

Care about LLM evaluation? 🤖 🤔 We bring you🕊️ DOVE a massive (250M!) collection of LLMs outputs On different prompts, domains, tokens, models... Join our community effort to expand it with YOUR model predictions & become a co-author!

Noam Dahan (@dahan_noam) 's Twitter Profile Photo

ohhh cool work! love the grounding of NLP in cs theory. seems like automata are useful for analyzing the complexity of reasoning problems

HUJI NLP (@nlphuji) 's Twitter Profile Photo

We're at #NAACL2025! Presenting: 📍Cross-Lingual and Cross-Cultural Variation in Image Descriptions Thu May 1, 5:00 PM Ruidoso 📍The State and Fate of Summarization Datasets: A Survey Fri May 2, 12:00 PM Ruidoso Uri Berger , Shachar Don-Yehiya, Noam Dahan

We're at #NAACL2025! Presenting:
📍Cross-Lingual and Cross-Cultural Variation in Image Descriptions
Thu May 1, 5:00 PM Ruidoso

📍The State and Fate of Summarization Datasets: A Survey
Fri May 2, 12:00 PM Ruidoso

<a href="/uriberger88/">Uri Berger</a> , <a href="/Shachar_Don/">Shachar Don-Yehiya</a>, <a href="/Dahan_Noam/">Noam Dahan</a>
Noam Dahan (@dahan_noam) 's Twitter Profile Photo

Tomorrow I'll present "The State and Fate of Summarization Datasets" at #NAACL2025! I'll cover gaps in terminology, discoverability and multilingual coverage across 130+ datasets in 104 languages, and share how our work can help navigate this space. 🗓️Fri May 2, 12:00 PM Ruidoso

Tomorrow I'll present "The State and Fate of Summarization Datasets" at #NAACL2025!

I'll cover gaps in terminology, discoverability and multilingual coverage across 130+ datasets in 104 languages, and share how our work can help navigate this space.
🗓️Fri May 2, 12:00 PM Ruidoso
Noam Dahan (@dahan_noam) 's Twitter Profile Photo

Wow Dove is an extremely rich updating resource! Currently contains 250M prompt perturbations and model outputs on popular benchmarks. Congrats Eliya Habba!

Noy Sternlicht (@noysternlicht) 's Twitter Profile Photo

🚨 New paper! We present CHIMERA — a KB of 28K+ scientific idea recombinations 💡 It captures how researchers blend concepts or take inspiration across fields, enabling: 1. Meta-science 2. Training models to predict new combos noy-sternlicht.github.io/CHIMERA-Web 👇 Findings & data:

Noam Dahan (@dahan_noam) 's Twitter Profile Photo

Tiny milestone🥲: Over a hundred people have used our platform to explore summarization datasets across 104 languages. Check it out if you're looking for data: github.com/edahanoam/Awes…

Tiny milestone🥲: Over a hundred people have used our platform to explore summarization datasets across 104 languages. Check it out if you're looking for data: github.com/edahanoam/Awes…
Nitay Calderon (@nitcal) 's Twitter Profile Photo

Preferences drive modern LLM research and development: from model alignment to evaluation. But how well do we understand them? Excited to share our new preprint: Multi-domain Explainability of Preferences arxiv.org/abs/2505.20088 Roi Reichart Liat 🧵👇 1/11

Preferences drive modern LLM research and development: from model alignment to evaluation.
But how well do we understand them?

Excited to share our new preprint:
Multi-domain Explainability of Preferences
arxiv.org/abs/2505.20088

<a href="/roireichart/">Roi Reichart</a> <a href="/LiatEinDor/">Liat</a>
🧵👇
1/11