Clement Neo @ ICLR 25 🇸🇬 (@_clementneo) 's Twitter Profile
Clement Neo @ ICLR 25 🇸🇬

@_clementneo

Mechanistic interpretability @ SG AISI also with Apart Research

ID: 1367720335244599302

linkhttp://clementneo.com calendar_today05-03-2021 06:19:15

397 Tweet

374 Followers

260 Following

Clement Neo @ ICLR 25 🇸🇬 (@_clementneo) 's Twitter Profile Photo

I will be presenting my first ever poster at EMNLP 2024 from 10:30am-12pm today in the Jasmine room! I think I have a really nice poster so come check it out if you’re around :)

Clement Neo @ ICLR 25 🇸🇬 (@_clementneo) 's Twitter Profile Photo

This reminds me of the phenomenon I think I saw (but can’t find/verify) where Claude was somewhat aware that its response got pre-filled and had a similar disbelief to the earlier part of its response. Does anyone know what I’m referring to?

Clement Neo @ ICLR 25 🇸🇬 (@_clementneo) 's Twitter Profile Photo

This is going to be an interesting social, and the team organizing this are super cool! The public sector is often a good indicator of conservative LLM adopters worried about safety guarantees, and I’ve learned a ton from these folks. Definitely do attend if you’re free.