Nora Kassner (@kassnernora) Twitter Tweets • TwiCopy

AI for Global Goals

a year ago

📢❗ After the resounding success of #OxML summer schools, we proudly present a pioneering new course on Generative AI at The London School of Economics and Political Science (LSE), this month!🎓💡 This intensive program will equip participants with cutting-edge knowledge

thumb_up_off_alt13

chat_bubble_outline1

repeat5

shareShare

Sohee Yang

@soheeyang_

a year ago

Our paper "Do Large Language Models Latently Perform Multi-Hop Reasoning?" will be presented at #ACL2024 today. 📍 Mon 14:00-15:30 Poster Session 2 (Conv. Center A1) Please visit our poster if you are interested, and catch me to chat about the latent reasoning ability of LLMs!

thumb_up_off_alt103

chat_bubble_outline3

repeat14

shareShare

Sohee Yang

@soheeyang_

a year ago

🚨 New Paper 🚨 Can LLMs perform latent multi-hop reasoning without exploiting shortcuts? We find the answer is yes – they can recall and compose facts not seen together in training or guessing the answer, but success greatly depends on the type of the bridge entity (80%+ for

thumb_up_off_alt192

chat_bubble_outline7

repeat46

shareShare

Sebastian Riedel (@[email protected])

@riedelcastro

a year ago

Frontier models can do this stuff, but also not! Opinions differ on how much we even want this (CC Geoffrey Irving), but understanding the patterns will be critical regardless. Been a pleasure to work with Latent Reasoning Dream Team Sohee Yang Mor Geva Nora Kassner!

thumb_up_off_alt27

chat_bubble_outline0

repeat6

shareShare

Edoardo Ponti

@pontiedoardo

a year ago

We had a blast at this year's ELLIS Schloss Dagstuhl seminar on "Modular and Agentive LLMs". Thanks everyone for participating!

We had a blast at this year's <a href="/ELLISforEurope/">ELLIS</a> <a href="/dagstuhl/">Schloss Dagstuhl</a> seminar on "Modular and Agentive LLMs".

Thanks everyone for participating!

thumb_up_off_alt57

chat_bubble_outline1

repeat9

shareShare

Pasquale Minervini is hiring postdocs! 🚀

@pminervini

a year ago

Sohee (Sohee Yang) in the house! 🚀🚀🚀🚀

Sohee (<a href="/soheeyang_/">Sohee Yang</a>) in the house! 🚀🚀🚀🚀

thumb_up_off_alt23

chat_bubble_outline0

repeat6

shareShare

Aida Nematzadeh 🦋

@aidanematzadeh

a year ago

I am hiring for RS/RE positions! If you are interested in language-flavored multimodal learning, evaluation, or post-training apply here 🦎 boards.greenhouse.io/deepmind/jobs/… I will also be #NeurIPS2024 so come say hi! (Please email me to find time to chat)

thumb_up_off_alt211

chat_bubble_outline4

repeat44

shareShare

Google DeepMind

@googledeepmind

a year ago

Welcome to the world, Gemini 2.0 ✨ our most capable AI model yet. We're first releasing an experimental version of 2.0 Flash ⚡ It has better performance, new multimodal output, Google tool use - and paves the way for new agentic experiences. 🧵 goo.gle/gemini-2

thumb_up_off_alt1,1K

chat_bubble_outline74

repeat433

shareShare

Shrestha Basu Mallick

@shresbm

a year ago

The Gemini 2.0 era begins with 2.0 Flash Experimental release ⚡️ 📈2.0 Flash beats 1.5 Pro across factuality, reasoning, coding, math. 📳 More modalities - image and audio out (in EAP) 🔧 Native tool use for Google Search, code execution and 3P functions 🆕 a new multimodal,

thumb_up_off_alt187

chat_bubble_outline6

repeat13

shareShare

Alexandra Chronopoulou

@alexandraxron

10 months ago

We are organizing Repl4NLP 2025 along with Freda Shi Giorgos Vernikos Vaibhav Adlakha Xiang Lorraine Li Bodhisattwa Majumder. The workshop will be co-located with NAACL 2025 in Albuquerque, New Mexico and we plan to have a great panel of speakers. Consider submitting your coolest work!

thumb_up_off_alt23

chat_bubble_outline0

repeat7

shareShare

Sohee Yang

@soheeyang_

8 months ago

Excited to share that the code and datasets for our papers on latent multi-hop reasoning are finally available on GitHub: github.com/google-deepmin… We hope these resources support further research in this area. Thanks for your patience as we worked through the release process!

thumb_up_off_alt403

chat_bubble_outline5

repeat72

shareShare

Sian Gooding

@siangooding

7 months ago

Google DeepMind Edward Grefenstette 🥳We have had a lot of interest in the role and are now asking potential candidates to fill out this form. If we are going forward with a referral, you will hear from us! forms.gle/8Y4oEvdGLZmmo1… Thanks again!

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Sundar Pichai

@sundarpichai

5 months ago

Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads lmarena.ai with a 24pt Elo score jump since the previous version. We also

thumb_up_off_alt4,4K

chat_bubble_outline214

repeat468

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 months ago

How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts? "We show that models are effective at identifying most unhelpful thoughts but struggle to recover from the same thoughts when these are injected into their thinking process, causing significant

thumb_up_off_alt114

chat_bubble_outline4

repeat13

shareShare

Sohee Yang

@soheeyang_

5 months ago

🚨 New Paper 🧵 How effectively do reasoning models reevaluate their thought? We find that: - Models excel at identifying unhelpful thoughts but struggle to recover from them - Smaller models can be more robust - Self-reevaluation ability is far from true meta-cognitive awareness

thumb_up_off_alt103

chat_bubble_outline3

repeat24

shareShare

Logan Kilpatrick

@officiallogank

5 months ago

The progress of Gemini over the last year +

thumb_up_off_alt1,1K

chat_bubble_outline153

repeat139

shareShare

Partha Talukdar

@partha_p_t

4 months ago

Google DeepMind India 🇮🇳 & Japan 🇯🇵 are looking for strong candidates in multilinguality, multicultural, & multimodality areas. RS Bangalore: job-boards.greenhouse.io/deepmind/jobs/… RS Tokyo: job-boards.greenhouse.io/deepmind/jobs/… RE Tokyo: job-boards.greenhouse.io/deepmind/jobs/…

thumb_up_off_alt154

chat_bubble_outline2

repeat24

shareShare

Mor Geva

@megamor2

4 months ago

📍2025-07-28 18:00 - 19:30 Hall 4/5 (and GEM workshop) Sohee Yang will present the results of our investigation at Google DeepMind on whether LLMs can perform latent multi-hop reasoning without exploiting shortcuts x.com/soheeyang_/sta… Nora Kassner Elena Gribovskaya Sebastian Riedel (@[email protected])

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

Sohee Yang

@soheeyang_

4 months ago

Our paper "Do Large Language Models Perform Latent Multi-Hop Reasoning without exploiting shortcuts?" will be presented at #ACL2025 today. 📍 Mon 18:00-19:30 Findings Posters (Hall X4 X5) Please visit our poster if you are interested! ✨

thumb_up_off_alt72

chat_bubble_outline0

repeat10

shareShare

Yanai Elazar

@yanaiela

2 months ago

Organizing a workshop? Checkout our compiled material for organizing one: bigpictureworkshop.com/open-workshop (and hopefully we'll be back for another iteration of the Big Picture next year Allyson Ettinger, Nora Kassner, Sebastian Ruder @ ACL)

thumb_up_off_alt24

chat_bubble_outline0

repeat2

shareShare