Hanna Hajishirzi (@hannahajishirzi) Twitter Tweets • TwiCopy

Hanna Hajishirzi

@hannahajishirzi

+ Follow

Sr. Director at @allen_ai, Prof at @uw_cse; co-lead OLMo, Tulu; AI/NLP/ML researcher at @uw_nlp

ID: 1098819287446253569

linkhttps://homes.cs.washington.edu/~hannaneh/ calendar_today22-02-2019 05:38:27

661 Tweet

7,7K Followers

397 Following

Ai2

@allen_ai

3 months ago

We are #1 on the Hugging Face heatmap - this is what true openness looks like!🥇🎉 750+ models 230+ datasets And counting... Come build with us huggingface.co/spaces/cfahlgr…

thumb_up_off_alt169

chat_bubble_outline8

repeat27

shareShare

Hanna Hajishirzi

@hannahajishirzi

3 months ago

Yayyy!!! Best paper honorable mention at CVPR goes to our Molmo and Pixmo Ai2! This is now becoming a tend :) Last year both OLMo and Dolma received best paper awards at ACL.

thumb_up_off_alt121

chat_bubble_outline7

repeat8

shareShare

Wanna 🔎 inside Internet-scale LLM training data w/o spending 💰💰💰? Introducing infini-gram mini, an exact-match search engine with 14x less storage req than the OG infini-gram 😎 We make 45.6 TB of text searchable. Read on to find our Web Interface, API, and more. (1/n) ⬇️

thumb_up_off_alt59

chat_bubble_outline6

repeat18

shareShare

Ai2

@allen_ai

3 months ago

New updates for olmOCR, our fully open toolkit for transforming documents (PDFs & images) into clean markdown. We released: 1️⃣ New benchmark for fair comparison of OCR engines and APIs 2️⃣ Improved inference that is faster and cheaper to run 3️⃣ Docker image for easy deployment

thumb_up_off_alt286

chat_bubble_outline7

repeat40

shareShare

Nathan Lambert

@natolambert

3 months ago

New Ai2 office views for my meetings. We’re always hiring top AI talent excited about making the ecosystem more open.

thumb_up_off_alt461

chat_bubble_outline25

repeat28

shareShare

Hanna Hajishirzi

@hannahajishirzi

3 months ago

Amazing initiative. Congrats Andy Konwinski!

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Ai2

@allen_ai

3 months ago

How well can today’s models generalize, compose, or even innovate on unseen problems? OMEGA Ω is a new math benchmark that pushes LLMs beyond pattern-matching to test true mathematical reasoning. ⚡ allenai.org/blog/omega

thumb_up_off_alt76

chat_bubble_outline1

repeat5

shareShare

Hanna Hajishirzi

@hannahajishirzi

2 months ago

LLMArena for Scientific Questions! SciArena evaluates different LMs on how they answer scientific questions. Check this out!

thumb_up_off_alt31

chat_bubble_outline0

repeat3

shareShare

Joongwon Kim

@danieljwkim

2 months ago

Can we improve Llama 3’s reasoning abilities through post-training only? Introducing ASTRO, our new framework that teaches LLMs to perform in-context search and generate long CoT to solve math problems, via SFT and RL. Work done at @aiatmeta. 📄 Paper: arxiv.org/abs/2507.00417

thumb_up_off_alt234

chat_bubble_outline5

repeat46

shareShare

Hanna Hajishirzi

@hannahajishirzi

2 months ago

Check out IFBench, a new benchmark to evaluate LM's instruction following.

thumb_up_off_alt19

chat_bubble_outline0

repeat1

shareShare

Victoria Graf

@victoriawgraf

2 months ago

Worried about overfitting to IFEval? 🤔 Use ✨IFBench✨ our new, challenging instruction-following benchmark! Loved working w/ Valentina Pyatkin! Personal highlight: our multi-turn eval setting makes it possible to isolate constraint-following from the rest of the instruction 🔍

thumb_up_off_alt48

chat_bubble_outline2

repeat13

shareShare

Cirrascale Cloud Services

@cirrascale

2 months ago

🚨 Just announced: OLMo, Molmo & Tülu are now LIVE on the Cirrascale Inference Platform! It’s official, Cirrascale is the first to offer commercial inference endpoints for @Ai2’s OLMo, Molmo & Tülu models on our Inference Platform. Our Inference Platform provides a fully open,

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Zhiyuan Zeng

@zhiyuanzeng_

2 months ago

EvalTree accepted to Conference on Language Modeling 2025 - my first PhD work and first COLM paper 🙌! What would you like to see next—extensions, applications, or other directions? Always open to ideas! 🧐

EvalTree accepted to <a href="/COLM_conf/">Conference on Language Modeling</a> 2025 - my first PhD work and first COLM paper 🙌!

What would you like to see next—extensions, applications, or other directions? Always open to ideas! 🧐

thumb_up_off_alt199

chat_bubble_outline6

repeat14

shareShare

Weijia Shi

@weijiashi2

2 months ago

Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmo💪, a mixture-of-experts LM enabling: • Flexible training on your local data without sharing it • Flexible inference to opt in/out your data

thumb_up_off_alt197

chat_bubble_outline7

repeat59

shareShare

Niklas Muennighoff

@muennighoff

2 months ago

Sparse Mixture-of-Expert LLMs to opt data in & out on the fly — I think a compelling vision for a future where AI developers & publishers work together rather than filing lawsuits🙂

thumb_up_off_alt50

chat_bubble_outline1

repeat7

shareShare

Sewon Min

@sewon__min

2 months ago

It has been great working on the project with support from Ai2! I believe there are many meaningful ways different people and orgs can work together to build strong shared models, and data collaboration might be the most impactful form of it. 📄Paper:

thumb_up_off_alt131

chat_bubble_outline3

repeat12

shareShare

Will Knight

@willknight

2 months ago

New on WIRED: A novel type of distributed mixture-of-experts model from Ai2 (called FlexOlmo) allows data can be contributed to a frontier model confidentially, and even revoked after the model is built: wired.com/story/flexolmo…

thumb_up_off_alt29

chat_bubble_outline0

repeat11

shareShare

Hanna Hajishirzi

Ai2

Hanna Hajishirzi

Hao Xu

Ai2

Nathan Lambert

Hanna Hajishirzi

Ai2

Hanna Hajishirzi

Joongwon Kim

Hanna Hajishirzi

Victoria Graf

Cirrascale Cloud Services

Zhiyuan Zeng

Weijia Shi

Niklas Muennighoff

Sewon Min

Will Knight