Ivaxi Sheth (@ivakshi_s) 's Twitter Profile
Ivaxi Sheth

@ivakshi_s

PhD student @CISPA | Prev @Mila_Quebec @imperialcollege'20

Causality / LLMs / Safety

ID: 1335517450654412800

linkhttp://ivaxi0s.github.io calendar_today06-12-2020 09:32:37

111 Tweet

386 Followers

852 Following

Jan Wehner (@janwehner436164) 's Twitter Profile Photo

💥Representation Engineering is a new approach for controlling LLM behavior that has been exploding in popularity! 📜Our survey "Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models" spans 130 papers to answer all your questions about it

💥Representation Engineering is a new approach for controlling LLM behavior that has been exploding in popularity!
📜Our survey "Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models" spans 130 papers to answer all your questions about it
Anurag Singh (@_anurags14) 's Twitter Profile Photo

🚨 New Preprint Alert! 🚨 Are you interested in Imprecise Probability (IP)? Then check out our latest preprint "Truthful Elicitation of Imprecise Forecasts". Joint work with Siu Lun Chau and Krikamol (Hiring Postdoc). arxiv.org/abs/2503.16395 A quick thread🧵(1/3)

Ivaxi Sheth (@ivakshi_s) 's Twitter Profile Photo

I am excited to present our work on safety for open-ended AI systems at the FAR.AI Alignment workshop tomorrow and SSI-FM workshop on 27th #ICLR25 🇸🇬 If you are also thinking about similar challenges or just curious, would love to chat!

Ivaxi Sheth (@ivakshi_s) 's Twitter Profile Photo

Inspiring keynote by Tim Rocktäschel at ICLR on Open-endedness and automation of AI 🚀 However I hope that the unpredictable and uncontrollable nature of open-ended AI also inspires strong focus on Safety 👷‍♀️🦺👩‍🔬 x.com/sahar_abdelnab…

Inspiring keynote by <a href="/_rockt/">Tim Rocktäschel</a> at ICLR on Open-endedness and automation of AI 🚀

However I hope that the unpredictable and uncontrollable nature of open-ended AI also inspires strong focus on Safety 👷‍♀️🦺👩‍🔬

x.com/sahar_abdelnab…
Sahar Abdelnabi 🕊 (on 🦋) (@sahar_abdelnabi) 's Twitter Profile Photo

📢📢Our paper, "A Theory of Response Sampling in LLMs: Part Descriptive and Part Prescriptive" has been accepted at ACL 2025 (Main Conference)!! 🥳 Details below 👇🧵1/n

📢📢Our paper, "A Theory of Response Sampling in LLMs: Part Descriptive and Part Prescriptive" has been accepted at ACL 2025 (Main Conference)!! 🥳

Details below 👇🧵1/n
Sahar Abdelnabi 🕊 (on 🦋) (@sahar_abdelnabi) 's Twitter Profile Photo

Hawthorne effect describes how study participants modify their behavior if they know they are being observed In our paper 📢, we study if LLMs exhibit analogous patterns🧠 Spoiler: they do⚠️ 🧵1/n

Hawthorne effect describes how study participants modify their behavior if they know they are being observed

In our paper 📢, we study if LLMs exhibit analogous patterns🧠

Spoiler: they do⚠️
🧵1/n