Lukasz Staniszewski (@lukxst) Twitter Tweets • TwiCopy

Lukasz Staniszewski

@lukxst

+ Follow

ID: 1794472237644288000

calendar_today25-05-2024 20:55:06

4 Tweet

8 Followers

92 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🔥 New Paper! How can sparse autoencoders (SAEs) applied to diffusion models help us solve real-world challenges? 🚀 Introducing 𝗦𝗔𝗲𝗨𝗿𝗼𝗻: We use SAEs for unlearning in diffusion models and outperform existing baselines! Here's how it works: 🧵 1/

thumb_up_off_alt216

chat_bubble_outline4

repeat48

shareShare

Bartosz Cywiński

@bartoszcyw

5 months ago

🔥 New ICLR 2025 Paper! It would be cool to control the content of text generated by diffusion models with less than 1% of parameters, right? And how about doing it across diverse architectures and within various applications? 🚀 🫡 Together with Lukasz Staniszewski, we show how: 🧵 1/

thumb_up_off_alt127

chat_bubble_outline2

repeat27

shareShare

Bartosz Cywiński

@bartoszcyw

2 months ago

New paper: Deceptive LLMs may keep secrets from their operators. Can we elicit this latent knowledge? Maybe! Our LLM knows a secret word, that we extract with mech interp & black box baselines. We open source our model, how much better can you do? w/Emil Ryd Senthooran Rajamanoharan Neel Nanda

thumb_up_off_alt113

chat_bubble_outline2

repeat18

shareShare