Jad Kabbara (@jad_kabbara) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Excited to release my first lead project Magentic-UI at Microsoft Research, an OS web agent application designed for efficient human-agent interaction. CUA agents are cool but they're not so useful yet, Magentic-UI helps us study how to get value from them. github.com/microsoft/mage…

thumb_up_off_alt55

chat_bubble_outline1

repeat9

shareShare

Shayne Longpre

@shayneredford

2 months ago

🚨 Lucie-Aimée Kaffee and I are looking for a junior collaborator to research the Open Model Ecosystem! 🤖 Ideally, someone w/ AI/ML background, who can help w/ annotation pipeline + analysis. docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt98

chat_bubble_outline4

repeat23

shareShare

Eric

@eric_chamoun

2 months ago

What are NLP papers really saying about the purpose and use of their models/datasets? 🤔 Who are they for? What problems do they solve? How are they used? We built a framework + tool to: (1) analyze framing trends across papers (2) help authors reflect on their own framing 🧵

thumb_up_off_alt29

chat_bubble_outline1

repeat7

shareShare

EleutherAI

@aieleuther

a month ago

Can you train a performant language models without using unlicensed text? We are thrilled to announce the Common Pile v0.1, an 8TB dataset of openly licensed and public domain text. We train 7B models for 1T and 2T tokens and match the performance similar models like LLaMA 1&2

thumb_up_off_alt556

chat_bubble_outline10

repeat127

shareShare

Ziling Cheng

@ziling_cheng

a month ago

Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts follow a systematic failure mode — revealing how LLMs generalize using abstract classes + context cues, albeit unreliably. 📎 Paper: arxiv.org/abs/2505.22630 1/n

thumb_up_off_alt34

chat_bubble_outline1

repeat20

shareShare

Benno Krojer

@benno_krojer

a month ago

Excited to share the results of my internship research with AI at Meta, as part of a larger world modeling release! What subtle shortcuts are VideoLLMs taking on spatio-temporal questions? And how can we instead curate shortcut-robust examples at a large-scale? Details 👇🔬

Excited to share the results of my internship research with <a href="/AIatMeta/">AI at Meta</a>, as part of a larger world modeling release!

What subtle shortcuts are VideoLLMs taking on spatio-temporal questions?

And how can we instead curate shortcut-robust examples at a large-scale?

Details 👇🔬

thumb_up_off_alt59

chat_bubble_outline3

repeat22

shareShare

Victor Sanh

@sanhestpasmoi

a month ago

🔥Big exciting news - I've started a new company! 🚀 We are building AI agents that take actions in the real world by orchestrating the movement of physical goods. We're working with our first partners and are now growing the founding engineering team. We're building in NYC,

thumb_up_off_alt337

chat_bubble_outline37

repeat34

shareShare

Shayne Longpre

@shayneredford

23 days ago

Thrilled to collaborate on the launch of 📚 CommonPile v0.1 📚 ! Introducing the largest openly-licensed LLM pretraining corpus (8 TB), led by Nikhil Kandpal Brian Lester Colin Raffel. 📜: arxiv.org/pdf/2506.05209 📚🤖 Data & models: huggingface.co/common-pile 1/

Thrilled to collaborate on the launch of 📚 CommonPile v0.1 📚 !

Introducing the largest openly-licensed LLM pretraining corpus (8 TB), led by <a href="/kandpal_nikhil/">Nikhil Kandpal</a> <a href="/blester125/">Brian Lester</a> <a href="/colinraffel/">Colin Raffel</a>.

📜: arxiv.org/pdf/2506.05209
📚🤖 Data & models: huggingface.co/common-pile
1/

thumb_up_off_alt58

chat_bubble_outline2

repeat14

shareShare

Hope Schroeder

@schropes

21 days ago

1) Thrilled to be at #Facct2025 for the first time this week, representing a meta-research paper on positionality statements at FAccT from 2018-2024, in collaboration with Solon Barocas (Solon Barocas) and Akshansh Pareek.

thumb_up_off_alt28

chat_bubble_outline1

repeat4

shareShare

Andrei Lupu

@_andreilupu

20 days ago

Theory of Mind (ToM) is crucial for next gen LLM Agents, yet current benchmarks suffer from multiple shortcomings. Enter 💽 Decrypto, an interactive benchmark for multi-agent reasoning and ToM in LLMs! Work done with Timon Willi & Jakob Foerster at AI at Meta & Foerster Lab for AI Research 🧵👇

thumb_up_off_alt101

chat_bubble_outline4

repeat30

shareShare

Cesare Spinoso-Di Piano

@cesare_spinoso

20 days ago

A blizzard is raging in Montreal when your friend says “Wow, the weather is amazing!” Humans easily interpret irony, while LLMs struggle at it. We propose a 𝘳𝘩𝘦𝘵𝘰𝘳𝘪𝘤𝘢𝘭-𝘴𝘵𝘳𝘢𝘵𝘦𝘨𝘺-𝘢𝘸𝘢𝘳𝘦 probabilistic framework as a solution. arxiv.org/abs/2506.09301 @ #acl2025

thumb_up_off_alt11

chat_bubble_outline1

repeat11

shareShare

Michiel Bakker

@bakkermichiel

14 days ago

🚨🚨 Excited to share a new paper led by Haiwen Li with the Community Notes team! LLMs will reshape the information ecosystem. Community Notes offers a promising model for keeping human judgment central but it's an open question how to best integrate LLMs. Thread👇

🚨🚨 Excited to share a new paper led by <a href="/Li_Haiwen_/">Haiwen Li</a> with the <a href="/CommunityNotes/">Community Notes</a> team!

LLMs will reshape the information ecosystem. Community Notes offers a promising model for keeping human judgment central but it's an open question how to best integrate LLMs.

Thread👇

thumb_up_off_alt144

chat_bubble_outline5

repeat31

shareShare

Bashar Alhafni

@balhafni

8 days ago

Extremely excited to share that I've joined the NLP department MBZUAI as a tenure-track Assistant Professor!

Extremely excited to share that I've joined the NLP department <a href="/mbzuai/">MBZUAI</a> as a tenure-track Assistant Professor!

thumb_up_off_alt139

chat_bubble_outline13

repeat4

shareShare

Shayne Longpre

@shayneredford

8 days ago

Existing AI Agent benchmarks are broken 🤖💔 Great work by Yuxuan Zhu and Daniel Kang identify + fix issues, and establish rigorous best practices for Agentic AI benchmarks! Check out the blog: ddkang.substack.com/p/ai-agent-ben…

Existing AI Agent benchmarks are broken 🤖💔

Great work by <a href="/maxYuxuanZhu/">Yuxuan Zhu</a> and <a href="/daniel_d_kang/">Daniel Kang</a> identify + fix issues, and establish rigorous best practices for Agentic AI benchmarks!

Check out the blog: ddkang.substack.com/p/ai-agent-ben…

thumb_up_off_alt115

chat_bubble_outline3

repeat20

shareShare

Joey Bose

@bose_joey

7 days ago

🎉Personal update: I'm thrilled to announce that I'm joining Imperial College London Imperial College London as an Assistant Professor of Computing Imperial Computing starting January 2026. My future lab and I will continue to work on building better Generative Models 🤖, the hardest

thumb_up_off_alt605

chat_bubble_outline97

repeat33

shareShare

Shayne Longpre

@shayneredford

2 days ago

Copyrighted 🚧, private 🛑, and sensitive ☢️ data remain major challenges for AI. FlexOlmo introduces an architectural mechanism to flexibly opt-in/opt-out segments of data in the training weights, **at inference time**. (Prior common solutions were to filter your data once

thumb_up_off_alt36

chat_bubble_outline1

repeat6

shareShare

Nicholas Meade

@ncmeade

a day ago

I'll be at #ICML2025 this week presenting SafeArena (Wednesday 11AM - 1:30PM in East Exhibition Hall E-701). Come by to chat with me about web agent safety (or anything else safety-related)!

thumb_up_off_alt30

chat_bubble_outline1

repeat10

shareShare

Siva Reddy

@sivareddyg

16 hours ago

I am speaking at 10 am PT on a slightly different topic than I usually talk about 🙂: "Simple Ideas Can Have Mighty Effects: Don't Take LLM Fundamentals for Granted" Check out if you're around. #ICML2025

thumb_up_off_alt26

chat_bubble_outline0

repeat4

shareShare

Shayne Longpre

@shayneredford

9 hours ago

Excited to present our AI Flaw Disclosure paper at #ICML2025 in Vancouver!🌲🌊🏔️ Swing by our poster session in East Exhibition Halls A-B E-606!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Jad Kabbara

Gate.io

Hussein Mozannar

Shayne Longpre

Eric

EleutherAI

Ziling Cheng

Benno Krojer

Victor Sanh

Shayne Longpre

Hope Schroeder

Andrei Lupu

Cesare Spinoso-Di Piano

Michiel Bakker

Bashar Alhafni

Shayne Longpre

Joey Bose

Shayne Longpre

Nicholas Meade

Siva Reddy

Shayne Longpre