
Jad Kabbara
@jad_kabbara
NLP Postdoc @MIT Center for Constructive Communication (CCC). PhD from McGill University @rllabmcgill & @Mila_Quebec. @AUB_Lebanon alum.
ID: 847162226914013185
http://www.mit.edu/~jkabbara/ 29-03-2017 19:03:09
1,1K Tweet
1,1K Followers
745 Following

Excited to release my first lead project Magentic-UI at Microsoft Research, an OS web agent application designed for efficient human-agent interaction. CUA agents are cool but they're not so useful yet, Magentic-UI helps us study how to get value from them. github.com/microsoft/mageβ¦

π¨ Lucie-AimΓ©e Kaffee and I are looking for a junior collaborator to research the Open Model Ecosystem! π€ Ideally, someone w/ AI/ML background, who can help w/ annotation pipeline + analysis. docs.google.com/forms/d/e/1FAIβ¦




Excited to share the results of my internship research with AI at Meta, as part of a larger world modeling release! What subtle shortcuts are VideoLLMs taking on spatio-temporal questions? And how can we instead curate shortcut-robust examples at a large-scale? Details ππ¬



Thrilled to collaborate on the launch of π CommonPile v0.1 π ! Introducing the largest openly-licensed LLM pretraining corpus (8 TB), led by Nikhil Kandpal Brian Lester Colin Raffel. π: arxiv.org/pdf/2506.05209 ππ€ Data & models: huggingface.co/common-pile 1/


1) Thrilled to be at #Facct2025 for the first time this week, representing a meta-research paper on positionality statements at FAccT from 2018-2024, in collaboration with Solon Barocas (Solon Barocas) and Akshansh Pareek.

Theory of Mind (ToM) is crucial for next gen LLM Agents, yet current benchmarks suffer from multiple shortcomings. Enter π½ Decrypto, an interactive benchmark for multi-agent reasoning and ToM in LLMs! Work done with Timon Willi & Jakob Foerster at AI at Meta & Foerster Lab for AI Research π§΅π

A blizzard is raging in Montreal when your friend says βWow, the weather is amazing!β Humans easily interpret irony, while LLMs struggle at it. We propose a π³π©π¦π΅π°π³πͺπ€π’π-π΄π΅π³π’π΅π¦π¨πΊ-π’πΈπ’π³π¦ probabilistic framework as a solution. arxiv.org/abs/2506.09301 @ #acl2025




Existing AI Agent benchmarks are broken π€π Great work by Yuxuan Zhu and Daniel Kang identify + fix issues, and establish rigorous best practices for Agentic AI benchmarks! Check out the blog: ddkang.substack.com/p/ai-agent-benβ¦


πPersonal update: I'm thrilled to announce that I'm joining Imperial College London Imperial College London as an Assistant Professor of Computing Imperial Computing starting January 2026. My future lab and I will continue to work on building better Generative Models π€, the hardest


