
Dr. Karen Ullrich
@karen_ullrich
Research scientist at FAIR NY + collab w/ Vector Institute. ❤️ Machine Learning + Information Theory. Previously, PhD at UoAmsterdam, intern at DeepMind + MSRC.
ID: 2236492597
http://karenullrich.info 08-12-2013 19:32:19
265 Tweet
5,5K Followers
586 Following



Researchers at CDS and AI at Meta prove vulnerabilities in AI language models are unavoidable, but introduce E-RLHF, a method to reduce jailbreaking. CDS’ Jingtong Su, Julia Kempe, and Dr. Karen Ullrich push AI safety forward. Full details: nyudatascience.medium.com/ai-language-mo…

Folks, I am posting my NeurIPS schedule daily in hopes to see folks, thanks Thomas Kipf the idea ;) 11-12.30 WiML round tables 1.30-4 Beyond Decoding, Tutorial


For those into jailbreaking LLMs: our poster "Mission Impossible" today shows the fundamental limits of LLM alignment - and improved ways to go about it, nonetheless. With Dr. Karen Ullrich & Jingtong Su #2302 11am - 2pm Poster Session 3 East NYU Center for Data Science AI at Meta #NeurIPS2024




