Jay Alammar (@jayalammar) 's Twitter Profile
Jay Alammar

@jayalammar

Writer jalammar.github.io. O'Reilly Author LLM-book.com. LLM Builder @Cohere. Visualizing AI one concept at a time.

ID: 1245260977626587136

linkhttps://www.youtube.com/channel/UCmOwsoHty5PrmE-3QhUBfPQ calendar_today01-04-2020 08:05:51

1,1K Tweet

43,43K Followers

1,1K Following

wh (@nrehiew_) 's Twitter Profile Photo

Cohere's Command A report is an extremely extensive paper on how to train a modern LLM in 2025. But it's a model for very different but specific use cases. Let's talk about it

Cohere's Command A report is an extremely extensive paper on how to train a modern LLM in 2025. But it's a model for very different but specific use cases.

Let's talk about it
Matthias Gallé (@mgalle) 's Twitter Profile Photo

A year ago we released LBBP - a drop-in replacement of HumanEval that was more challenging and less leaked Internally we have been using the multilingual version of this for benchmarking, and as code is not only python we decided to release that as well huggingface.co/datasets/Coher…

Stephanie Chan (@scychan_brains) 's Twitter Profile Photo

Some years ago, I got trapped in a Massive Trough of Imposter Syndrome. It took more than a year to dig myself out of it, but the following framework really helped me. It feels a bit vulnerable to share, but I hope it might help a few others too! A short thread 🧵🙂

Metro Richmond Zoo (@metrorhmdzoo) 's Twitter Profile Photo

Patrick turns 34, receives a royal cloak, and then ties the perfect knot - because even jungle royalty needs a signature look! 👑🦧✨ King behavior. #metrorichmondzoo #rva #orangutans

Jay Alammar (@jayalammar) 's Twitter Profile Photo

Truly excited to see Suhas Pai's book go out into the world. I've had the pleasure of speaking with Suhas for hours and hours over the last couple of years and debating where LLMs are and where they would go. Certainly check it out! It's full of hard-won knowledge.

Truly excited to see <a href="/piesauce/">Suhas Pai</a>'s book go out into the world. I've had the pleasure of speaking with Suhas for hours and hours over the last couple of years and debating where LLMs are and where they would go. Certainly check it out! It's full of hard-won knowledge.
Graham Neubig (@gneubig) 's Twitter Profile Photo

Some people have said that OpenAI achieved state of the art results on the SWE-Bench Verified leaderboard with their codex model, but that's actually not quite correct, no matter how you measure it. A quick 🧵

Some people have said that OpenAI achieved state of the art results on the SWE-Bench Verified leaderboard with their codex model, but that's actually not quite correct, no matter how you measure it.

A quick 🧵
Ash Vardanian (@ashvardanian) 's Twitter Profile Photo

Why not just use CONTRIBUTING.md ?! That’s where one already puts code-style guides, relevant references, and compilation snippets…

Command A(idan) (@aidangomez) 's Twitter Profile Photo

Very proud to announce Cohere’s partnerships with Canada and Britain! Thank you Prime Ministers Carney and Starmer for your vision and leadership. 🇨🇦 ❤️ 🇬🇧

Stas Bekman (@stasbekman) 's Twitter Profile Photo

My first project at Snowflake AI Research is complete! I present to you Arctic Long Sequence Training (ALST) Paper: arxiv.org/abs/2506.13996 Blog: snowflake.com/en/engineering… ALST is a set of modular, open-source techniques that enable training on sequences up to 15 million

My first project at <a href="/Snowflake/">Snowflake</a> AI Research is complete! 

I present to you Arctic Long Sequence Training (ALST) 

Paper: arxiv.org/abs/2506.13996
Blog: snowflake.com/en/engineering…

ALST is a set of modular, open-source techniques that enable training on sequences up to 15 million