Yaniv Nikankin (@ynikankin) 's Twitter Profile
Yaniv Nikankin

@ynikankin

PhD student @TechnionLive, looking inside language models

ID: 1479524606885175306

linkhttp://yaniv.nikankin.com calendar_today07-01-2022 18:45:57

34 Tweet

85 Followers

309 Following

Aaron Mueller (@amuuueller) 's Twitter Profile Photo

I'm recruiting PhD students for our new lab, coming to Boston University in Fall 2025! Our lab aims to understand, improve, and precisely control how language is learned and used in natural language systems (such as language models). Details below!

I'm recruiting PhD students for our new lab, coming to Boston University in Fall 2025!

Our lab aims to understand, improve, and precisely control how language is learned and used in natural language systems (such as language models).

Details below!
Jonas Geiping (@jonasgeiping) 's Twitter Profile Photo

Ok, so I can finally talk about this! We spent the last year (actually a bit longer) training an LLM with recurrent depth at scale. The model has an internal latent space in which it can adaptively spend more compute to think longer. I think the tech report ...🐦‍⬛

Ok, so I can finally talk about this! 

We spent the last year (actually  a bit longer) training an  LLM with recurrent depth at scale.

The model has an internal latent space in which it can adaptively spend more compute to think longer. 

I think the tech report ...🐦‍⬛
Ai2 (@allen_ai) 's Twitter Profile Photo

Meet Ai2 Paper Finder, an LLM-powered literature search system. Searching for relevant work is a multi-step process that requires iteration. Paper Finder mimics this workflow — and helps researchers find more papers than ever 🔍

Meet Ai2 Paper Finder, an LLM-powered literature search system.

Searching for relevant work is a multi-step process that requires iteration. Paper Finder mimics this workflow — and helps researchers find more papers than ever 🔍
Yaniv Nikankin (@ynikankin) 's Twitter Profile Photo

Interested in mechanistic interpretability? We'll be presenting our work on arithmetic mechanisms in LLMs later this week at #ICLR2025. DM me if you're there and want to chat about AI interpretability. 📆Friday, April 25th, 10-12:30 (Poster #243) 🔖iclr.cc/virtual/2025/p…

Interested in mechanistic interpretability? We'll be presenting our work on arithmetic mechanisms in LLMs later this week at #ICLR2025.
DM me if you're there and want to chat about AI interpretability.
📆Friday, April 25th, 10-12:30 (Poster #243)
🔖iclr.cc/virtual/2025/p…
Dana Arad 🎗️ (@dana_arad4) 's Twitter Profile Photo

Tried steering with SAEs and found that not all features behave as expected? Check out our new preprint - "SAEs Are Good for Steering - If You Select the Right Features" 🧵

Tried steering with SAEs and found that not all features behave as expected?

Check out our new preprint - "SAEs Are Good for Steering - If You Select the Right Features"  🧵
Michael Hanna (@michaelwhanna) 's Twitter Profile Photo

Mateusz and I are excited to announce circuit-tracer, a library that makes circuit-finding simple! Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on neuronpedia: shorturl.at/SUX2A

<a href="/mntssys/">Mateusz</a> and I are excited to announce circuit-tracer, a library that makes circuit-finding simple!

Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on <a href="/neuronpedia/">neuronpedia</a>: shorturl.at/SUX2A
Zorik Gekhman (@zorikgekhman) 's Twitter Profile Photo

Now accepted to #COLM2025! We formally define hidden knowledge in LLMs and show its existence in a controlled study. We even show that a model can know the answer yet fail to generate it in 1,000 attempts 😵 Looking forward to presenting and discussing our work in person.