Noam Dahan (@dahan_noam) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

New preprint! ✨ Interested in LLM-as-a-Judge? Want to get the best judge for ranking your system? our new work is just for you: "JuStRank: Benchmarking LLM Judges for System Ranking" 🕺💃 arxiv.org/abs/2412.09569

thumb_up_off_alt30

chat_bubble_outline2

repeat9

shareShare

Eliya Habba

@eliyahabba

6 months ago

🌍 AI is changing the world. Is AI regulation on the right track? 🤔 While regulators rely on benchmarking 📊, we show why it cannot guarantee AI behavior: arxiv.org/pdf/2501.15693 Excited about this multidisciplinary collaboration! Gabriel Stanovsky, Renana Keydar , Gadi Perl

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Gallil Maimon

@gallilmaimon

5 months ago

🗣️🧠 Speech Language Models require lots of compute to train, right? In our new paper, we test is it possible to train an SLM on 1xA5000 gpu in 24 hours? The results may surprise you (they even surprised us)! Tips, open source resources, full paper 👇🏻

thumb_up_off_alt136

chat_bubble_outline7

repeat37

shareShare

Eliya Habba

@eliyahabba

4 months ago

Care about LLM evaluation? 🤖 🤔 We bring you🕊️ DOVE a massive (250M!) collection of LLMs outputs On different prompts, domains, tokens, models... Join our community effort to expand it with YOUR model predictions & become a co-author!

thumb_up_off_alt49

chat_bubble_outline2

repeat14

shareShare

Noam Dahan

@dahan_noam

3 months ago

ohhh cool work! love the grounding of NLP in cs theory. seems like automata are useful for analyzing the complexity of reasoning problems

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Noam Dahan

@dahan_noam

3 months ago

Already in ABQ for NAACL2025 to talk about summarization datasets and green chile arxiv.org/abs/2411.04585

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Noam Dahan

@dahan_noam

3 months ago

3k+ submissions to NAACL25! Resources/Eval is now the largest track

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

HUJI NLP

@nlphuji

3 months ago

We're at #NAACL2025! Presenting: 📍Cross-Lingual and Cross-Cultural Variation in Image Descriptions Thu May 1, 5:00 PM Ruidoso 📍The State and Fate of Summarization Datasets: A Survey Fri May 2, 12:00 PM Ruidoso Uri Berger , Shachar Don-Yehiya, Noam Dahan

thumb_up_off_alt32

chat_bubble_outline0

repeat6

shareShare

Noam Dahan

@dahan_noam

3 months ago

Tomorrow I'll present "The State and Fate of Summarization Datasets" at #NAACL2025! I'll cover gaps in terminology, discoverability and multilingual coverage across 130+ datasets in 104 languages, and share how our work can help navigate this space. 🗓️Fri May 2, 12:00 PM Ruidoso

thumb_up_off_alt29

chat_bubble_outline0

repeat4

shareShare

Noam Dahan

@dahan_noam

3 months ago

Join me at 12:00 in Ruidoso! #NAACL2025

thumb_up_off_alt17

chat_bubble_outline1

repeat1

shareShare

Noam Dahan

@dahan_noam

3 months ago

My glass half full: finally washed the dishes Overleaf

My glass half full: finally washed the dishes <a href="/overleaf/">Overleaf</a>

thumb_up_off_alt19

chat_bubble_outline0

repeat0

shareShare

Noam Dahan

@dahan_noam

2 months ago

Wow Dove is an extremely rich updating resource! Currently contains 250M prompt perturbations and model outputs on popular benchmarks. Congrats Eliya Habba!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Noy Sternlicht

@noysternlicht

2 months ago

🚨 New paper! We present CHIMERA — a KB of 28K+ scientific idea recombinations 💡 It captures how researchers blend concepts or take inspiration across fields, enabling: 1. Meta-science 2. Training models to predict new combos noy-sternlicht.github.io/CHIMERA-Web 👇 Findings & data:

thumb_up_off_alt57

chat_bubble_outline4

repeat22

shareShare

Noam Dahan

@dahan_noam

2 months ago

Tiny milestone🥲: Over a hundred people have used our platform to explore summarization datasets across 104 languages. Check it out if you're looking for data: github.com/edahanoam/Awes…

thumb_up_off_alt15

chat_bubble_outline1

repeat1

shareShare

Nitay Calderon

@nitcal

2 months ago

Preferences drive modern LLM research and development: from model alignment to evaluation. But how well do we understand them? Excited to share our new preprint: Multi-domain Explainability of Preferences arxiv.org/abs/2505.20088 Roi Reichart Liat 🧵👇 1/11

thumb_up_off_alt32

chat_bubble_outline2

repeat16

shareShare

Noam Dahan

Gate.io

Asaf Yehudai

Eliya Habba

Gallil Maimon

Eliya Habba

Noam Dahan

Noam Dahan

Noam Dahan

HUJI NLP

Noam Dahan

Noam Dahan

Noam Dahan

Noam Dahan

Noy Sternlicht

Noam Dahan

Nitay Calderon