Yash Maurya (@yash_maurya01) 's Twitter Profile
Yash Maurya

@yash_maurya01

MS Privacy Engineering @SCSatCMU

Federated Learning | Explainable & Responsible AI | Differential Privacy

ID: 217049073

linkhttp://yashmaurya.com calendar_today18-11-2010 12:58:41

7 Tweet

63 Followers

610 Following

Aman Priyanshu (@amanpriyanshu6) 's Twitter Profile Photo

1/ 🌐Google AI's BARD just dropped a bombshell with its latest YouTube extension feature! But with great power comes great responsibility. Digging deeper into the potential risks and vulnerabilities this update brings. #ai #infosecurity #Security

1/ 🌐<a href="/GoogleAI/">Google AI</a>'s BARD just dropped a bombshell with its latest YouTube extension feature! But with great power comes great responsibility. Digging deeper into the potential risks and vulnerabilities this update brings. #ai #infosecurity #Security
Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

I tried 14 of the multimodal reasoning examples from the Google DeepMind Gemini paper on OpenAI's chatGPT-4 (with vision). didn't even transcribe the prompts, I just pasted the images of prompts. GPT-4 gets ~12/14 right. 14 part boring thread.

I tried 14 of the multimodal reasoning  examples from the <a href="/GoogleDeepMind/">Google DeepMind</a>  Gemini paper on <a href="/OpenAI/">OpenAI</a>'s  chatGPT-4 (with vision). didn't even transcribe the prompts, I just pasted the images of prompts. 

GPT-4 gets ~12/14 right.

14 part boring thread.
Franklin Graves 🚀 (@franklingraves) 's Twitter Profile Photo

Fun fact I learned today: Meta tried to send GitHub a DMCA takedown when a user uploaded the weights associated with LLaMA. 🦙 How do you think it turned out?

Fun fact I learned today: Meta tried to send GitHub a DMCA takedown when a user uploaded the weights associated with LLaMA. 🦙 

How do you think it turned out?
Aman Priyanshu (@amanpriyanshu6) 's Twitter Profile Photo

I'm speaking at #pepr24 along with my co-authors Yash Maurya and Vy Tran in Santa Clara, June 3–4, 2024. Discover how LLMs challenge privacy tech! Join me! bit.ly/pepr2024

I'm speaking at #pepr24 along with my co-authors <a href="/yash_maurya01/">Yash Maurya</a> and Vy Tran in Santa Clara, June 3–4, 2024. Discover how LLMs challenge privacy tech! Join me! bit.ly/pepr2024
Aman Priyanshu (@amanpriyanshu6) 's Twitter Profile Photo

3/ Inspired by the question, I decided to investigate the 'enum' keyword allowed in function-calling. And guess what? I successfully cracked it! Here's a glimpse of how the jail-break works:

3/ Inspired by the question, I decided to investigate the 'enum' keyword allowed in function-calling. 
And guess what? I successfully cracked it! 
Here's a glimpse of how the jail-break works:
Pratiksha Thaker (@prthaker_) 's Twitter Profile Photo

🚨 Are you using empirical benchmarks to evaluate your LLM unlearning method? Our new paper arxiv.org/pdf/2410.02879 investigates how success on these benchmarks can be misleading. A🧵: 1/n

Mehul Damani @ ICLR (@mehuldamani2) 's Twitter Profile Photo

🚨New Paper!🚨 We trained reasoning LLMs to reason about what they don't know. o1-style reasoning training improves accuracy but produces overconfident models that hallucinate more. Meet RLCR: a simple RL method that trains LLMs to reason and reflect on their uncertainty --

🚨New Paper!🚨
We trained reasoning LLMs to reason about what they don't know.

o1-style reasoning training improves accuracy but produces overconfident models that hallucinate more.

Meet RLCR: a simple RL method that trains LLMs to reason and reflect on their uncertainty --