Ekdeep Singh Lubana (@ekdeepl) Twitter Tweets • TwiCopy

Ekdeep Singh Lubana

@ekdeepl

+ Follow

Postdoc at CBS-NTT Program on Physics of Intelligence, Harvard University.

ID: 944451685711273984

linkhttp://ekdeepslubana.github.io calendar_today23-12-2017 06:16:43

445 Tweet

1,1K Followers

1,1K Following

Andrew Lee

@a_jy_l

6 months ago

New Preprint! Did you know that steering vectors from one LM can be transferred and re-used in another LM? We argue this is because token embeddings across LMs share many “global” and “local” geometric similarities! 1/N

thumb_up_off_alt40

chat_bubble_outline2

repeat5

shareShare

Kempner Institute at Harvard University

@kempnerinst

6 months ago

New in the Deeper Learning blog: Kempner researchers characterize the inherent bias of sparse autoencoders and call for a new generation of SAEs that are aware of concept geometry. kempnerinstitute.harvard.edu/research/deepe… by Sumedh Hindupur, Ekdeep Singh, Thomas Fel, (Dem + 1) x Ba #AI #autoencoders #ML

thumb_up_off_alt17

chat_bubble_outline0

repeat4

shareShare

Laura Ruis

@lauraruis

6 months ago

Excited to announce that this fall I'll be joining Jacob Andreas's amazing lab at MIT for a postdoc to work on interp. for reasoning (with Ev (like in 'evidence', not Eve) Fedorenko 🇺🇦 🤯 among others). Cannot wait to think more about this direction in such a dream academic context!

thumb_up_off_alt484

chat_bubble_outline45

repeat11

shareShare

Andrew Lee

@a_jy_l

6 months ago

🚨New preprint! How do reasoning models verify their own CoT? We reverse-engineer LMs and find critical components and subspaces needed for self-verification! 1/n

thumb_up_off_alt270

chat_bubble_outline7

repeat51

shareShare

Ekdeep Singh Lubana

@ekdeepl

5 months ago

Follow Valerie if you wanna keep learning more and more about dictionary learning magic!

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Ekdeep Singh Lubana

@ekdeepl

5 months ago

Signal boosting this sub-result from our work, which blew my mind and my mental model of how VLMs work!

thumb_up_off_alt17

chat_bubble_outline0

repeat2

shareShare

Ekdeep Singh Lubana

@ekdeepl

5 months ago

Exactly this! 😍

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Ekdeep Singh Lubana

@ekdeepl

5 months ago

Check out the thread of our recent ICML paper on using knowledge graphs to mechanistically study how model editing can deteriorate a neural network's capabilities!

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare