Yaniv Nikankin (@ynikankin) Twitter Tweets • TwiCopy

Yaniv Nikankin

@ynikankin

+ Follow

PhD student @TechnionLive, looking inside language models

ID: 1479524606885175306

linkhttp://yaniv.nikankin.com calendar_today07-01-2022 18:45:57

34 Tweet

85 Followers

309 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I'm recruiting PhD students for our new lab, coming to Boston University in Fall 2025! Our lab aims to understand, improve, and precisely control how language is learned and used in natural language systems (such as language models). Details below!

thumb_up_off_alt736

chat_bubble_outline11

repeat190

shareShare

Yossi Gandelsman

@ygandelsman

6 months ago

Check out the First Workshop on Mech Interp for Vision at #CVPR2025! Paper submissions: sites.google.com/view/miv-cvpr2…

thumb_up_off_alt38

chat_bubble_outline0

repeat5

shareShare

Jonas Geiping

@jonasgeiping

6 months ago

Ok, so I can finally talk about this! We spent the last year (actually a bit longer) training an LLM with recurrent depth at scale. The model has an internal latent space in which it can adaptively spend more compute to think longer. I think the tech report ...🐦‍⬛

thumb_up_off_alt2,2K

chat_bubble_outline51

repeat200

shareShare

Ai2

@allen_ai

4 months ago

Meet Ai2 Paper Finder, an LLM-powered literature search system. Searching for relevant work is a multi-step process that requires iteration. Paper Finder mimics this workflow — and helps researchers find more papers than ever 🔍

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat220

shareShare

Hadas Orgad

@orgadhadas

4 months ago

🎉 Our Actionable Interpretability workshop has been accepted to #ICML2025! 🎉 >> Follow Actionable Interpretability Workshop ICML2025 Tal Haklay Anja Reusch Marius Mosbach Sarah Wiegreffe Ian Tenney (@[email protected]) Mor Geva Paper submission deadline: May 9th!

🎉 Our Actionable Interpretability workshop has been accepted to #ICML2025! 🎉
>> Follow <a href="/ActInterp/">Actionable Interpretability Workshop ICML2025</a>

<a href="/tal_haklay/">Tal Haklay</a> <a href="/anja_reu/">Anja Reusch</a> <a href="/mariusmosbach/">Marius Mosbach</a> <a href="/sarahwiegreffe/">Sarah Wiegreffe</a> <a href="/iftenney/">Ian Tenney (@iftenney@sigmoid.social)</a> <a href="/megamor2/">Mor Geva</a>

Paper submission deadline: May 9th!

thumb_up_off_alt127

chat_bubble_outline1

repeat25

shareShare

Yaniv Nikankin

@ynikankin

3 months ago

Interested in mechanistic interpretability? We'll be presenting our work on arithmetic mechanisms in LLMs later this week at #ICLR2025. DM me if you're there and want to chat about AI interpretability. 📆Friday, April 25th, 10-12:30 (Poster #243) 🔖iclr.cc/virtual/2025/p…

thumb_up_off_alt83

chat_bubble_outline1

repeat10

shareShare

Yaniv Nikankin

@ynikankin

3 months ago

Having a standard benchmark is important as mechinterp matures. Check out our work and come chat with us at @iclr2025!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Dana Arad 🎗️

@dana_arad4

2 months ago

Tried steering with SAEs and found that not all features behave as expected? Check out our new preprint - "SAEs Are Good for Steering - If You Select the Right Features" 🧵

thumb_up_off_alt166

chat_bubble_outline7

repeat32

shareShare

Michael Hanna

@michaelwhanna

2 months ago

Mateusz and I are excited to announce circuit-tracer, a library that makes circuit-finding simple! Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on neuronpedia: shorturl.at/SUX2A

<a href="/mntssys/">Mateusz</a> and I are excited to announce circuit-tracer, a library that makes circuit-finding simple!

Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on <a href="/neuronpedia/">neuronpedia</a>: shorturl.at/SUX2A

thumb_up_off_alt199

chat_bubble_outline8

repeat45

shareShare

Zorik Gekhman

@zorikgekhman

19 days ago

Now accepted to #COLM2025! We formally define hidden knowledge in LLMs and show its existence in a controlled study. We even show that a model can know the answer yet fail to generate it in 1,000 attempts 😵 Looking forward to presenting and discussing our work in person.

thumb_up_off_alt58

chat_bubble_outline1

repeat14

shareShare

Yaniv Nikankin

Gate.io

Aaron Mueller

Yossi Gandelsman

Jonas Geiping

Ai2

Hadas Orgad

Yaniv Nikankin

Yaniv Nikankin

Dana Arad 🎗️

Michael Hanna

Zorik Gekhman