Tu Trinh (@thetututrain) Twitter Tweets • TwiCopy

Tu Trinh

@thetututrain

+ Follow

Aka Alina Trinh. ML research engineer @scale_AI | EECS MS @UCBerkeley @CHAI_Berkeley @berkeley_ai

ID: 1746648602183905280

calendar_today14-01-2024 21:41:24

3 Tweet

37 Followers

124 Following

Tu Trinh

@thetututrain

2 years ago

How can a robot self-assess when it has received enough demonstrations to perform a task correctly? Excited to present our work at #HRI2024 during Tuesday's session on Learning! Paper arxiv.org/abs/2211.15542 w/ Haoyu Chen and Daniel Brown

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Tu Trinh

@thetututrain

4 months ago

Cooking up cool stuff at work 🍜🤖 had a great time building model debate for data quality!

thumb_up_off_alt17

chat_bubble_outline3

repeat1

shareShare

Sonia

@soniajoseph_

2 months ago

Our paper "From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers" was accepted into NeurIPS! NeurIPS Conference We show that SAEs _are_ indeed useful for safety applications! SAEs can reliably detect, and meaningfully suppress, hallucinations.

Our paper "From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers" was accepted into NeurIPS! <a href="/NeurIPSConf/">NeurIPS Conference</a>

We show that SAEs _are_ indeed useful for safety applications!

SAEs can reliably detect, and meaningfully suppress, hallucinations.

thumb_up_off_alt274

chat_bubble_outline5

repeat24

shareShare