Igor Shilov (@_igorshilov) 's Twitter Profile
Igor Shilov

@_igorshilov

Anthropic AI Safety Fellow

PhD student at @imperialcollege.
ML, interpretability, privacy, and stuff
🏳️‍🌈

ID: 274134439

linkhttps://igorshilov.com/ calendar_today29-03-2011 20:03:53

6,6K Tweet

1,1K Followers

355 Following

Igor Shilov (@_igorshilov) 's Twitter Profile Photo

Trying out o3 on Bellingcat's OSINT challenges is so fun. It's clearly close to being good at these so it's worth a shot, but not quite there yet to do it reliably. challenge.bellingcat.com It nailed an aerial photo question in just 37 seconds, but struggling with most others

Ilia Shumailov🦔 (@iliaishacked) 's Twitter Profile Photo

Are modern large language models (LLMs) vulnerable to privacy attacks that can determine if given data was used for training? Models and dataset are quite large, what should we even expect? Our new paper looks into this exact question. 🧵 (1/10)

Are modern large language models (LLMs) vulnerable to privacy attacks that can determine if given data was used for training? Models and dataset are quite large, what should we even expect? Our new paper looks into this exact question. 🧵 (1/10)