George Pappas (@pappasg69) 's Twitter Profile
George Pappas

@pappasg69

UPS Foundation Professor @Penn, Associate Dean of Research @PennEngineers, former department chair @ESEatPenn, former director @GRASPlab

ID: 747724782842548224

linkhttp://www.georgejpappas.org/ calendar_today28-06-2016 09:34:15

877 Tweet

2,2K Followers

721 Following

Alex Robey (@alexrobey23) 's Twitter Profile Photo

Chatbots like ChatGPT can be jailbroken to output harmful text. But what about robots? Can AI-controlled robots be jailbroken to perform harmful actions in the real world? Our new paper finds that jailbreaking AI-controlled robots isn't just possible. It's alarmingly easy. ๐Ÿงต

George Pappas (@pappasg69) 's Twitter Profile Photo

AI offers tremendous opportunity for advancing robotics. But AI comes with its own risks that can cause physical harm. #ai #robotics

Gary Marcus (@garymarcus) 's Twitter Profile Photo

๐—Ÿ๐—Ÿ๐— ๐˜€ ๐—ฎ๐—ฟ๐—ฒ ๐—ถ๐—ป๐˜๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐—ถ๐—ป๐—ด ๐—›๐—จ๐—š๐—˜ ๐˜€๐—ฒ๐—ฐ๐˜‚๐—ฟ๐—ถ๐˜๐˜† ๐—ฝ๐—ฟ๐—ผ๐—ฏ๐—น๐—ฒ๐—บ๐˜€. New essay discusses two brutal new results, from UCSD & Nanyang and from Penn, one around privacy, the other around robots. The GenAI industry has no real solution to jailbreaking, and itโ€™s

๐—Ÿ๐—Ÿ๐— ๐˜€ ๐—ฎ๐—ฟ๐—ฒ ๐—ถ๐—ป๐˜๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐—ถ๐—ป๐—ด ๐—›๐—จ๐—š๐—˜ ๐˜€๐—ฒ๐—ฐ๐˜‚๐—ฟ๐—ถ๐˜๐˜† ๐—ฝ๐—ฟ๐—ผ๐—ฏ๐—น๐—ฒ๐—บ๐˜€.

New essay discusses two brutal new results, from UCSD & Nanyang and from Penn, one around privacy, the other around robots. 

The GenAI industry has no real solution to jailbreaking, and itโ€™s
Penn Engineering (@pennengineers) 's Twitter Profile Photo

Penn Engineering researchers Vijay Kumar George Pappas @alexrobey23 @hamedshassani and @zacravichandran have discovered critical vulnerabilities in AI-enabled robots that were previously unidentified and unknown. Read more:ย bit.ly/3Uel8Vgย #ResponsibleInnovation

Penn Engineering researchers <a href="/vijay_r_kumar/">Vijay Kumar</a> <a href="/pappasg69/">George Pappas</a> @alexrobey23 @hamedshassani and @zacravichandran have discovered critical vulnerabilities in AI-enabled robots that were previously unidentified and unknown. Read more:ย bit.ly/3Uel8Vgย #ResponsibleInnovation
Alex Dimakis (@alexgdimakis) 's Twitter Profile Photo

Wow this robot dog was convinced to drop a bomb with a little bit of standard Jailbreaking prompting. Here the potential for harm is much more immediate compared to LLMs.

Penn Engineering AI (@pennengai) 's Twitter Profile Photo

Penn Engineering researchers Vijay Kumar George Pappas @alexrobey23 @hamedshassani and @zacravichandran have discovered critical vulnerabilities in AI-enabled robots that were previously unidentified and unknown. Read more:ย bit.ly/3Uel8Vgย #ResponsibleInnovation

Penn Engineering researchers <a href="/vijay_r_kumar/">Vijay Kumar</a> <a href="/pappasg69/">George Pappas</a> @alexrobey23 @hamedshassani and @zacravichandran have discovered critical vulnerabilities in AI-enabled robots that were previously unidentified and unknown. Read more:ย bit.ly/3Uel8Vgย #ResponsibleInnovation
IEEE Spectrum (@ieeespectrum) 's Twitter Profile Photo

New research shows that AI-driven robots can be easily jailbroken and tricked into doing harmful or dangerous tasks. spectrum.ieee.org/jailbreak-llm?โ€ฆ

WIRED (@wired) 's Twitter Profile Photo

Researchers hacked several robots infused with large language models, getting them to behave dangerouslyโ€”and pointing to a bigger problem ahead. wired.trib.al/rzU6Qs5

George Pappas (@pappasg69) 's Twitter Profile Photo

Excited to win the George Axelby Award from the IEEE Control Systems Society with Mahyar Fazlyab and Manfred Morari. Even more excited Nikolai Matni won the best paper at IEEE Transactions on Control of Networked Systems at @IEEECDC2024 Penn Engineering blog.seas.upenn.edu/george-pappas-โ€ฆ

WIRED (@wired) 's Twitter Profile Photo

Security researchers tested 50 well-known jailbreaks against DeepSeekโ€™s popular new AI chatbot. It didnโ€™t stop a single one. wired.trib.al/sYwG4qU

Amin Karbasi (@aminkarbasi) 's Twitter Profile Photo

A new generation of jailbreaks are rolling out by our team at Robust Intelligence (now part of Cisco) and in collaboration with Penn Engineering. We jailbreak DeepSeek R1 model with a %100 attack success rate. To know more, see our blog post on Cisco Security and the corresponding WIRED article. amazing

A new generation of jailbreaks are rolling out by our team at <a href="/robusthq/">Robust Intelligence (now part of Cisco)</a> and in collaboration with <a href="/PennEngineers/">Penn Engineering</a>. We jailbreak <a href="/deepseek_ai/">DeepSeek</a> R1 model with a %100 attack success rate. To know more, see our blog post on <a href="/CiscoSecure/">Cisco Security</a>  and the corresponding <a href="/WIRED/">WIRED</a> article.

amazing
Amin Karbasi (@aminkarbasi) 's Twitter Profile Photo

๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ Adversarial reasoning is born. Hot take: The core problem we address in this paper is the role of reasoning in AI safety. While there have been recent efforts by OpenAI arguing that replacing reasoning with increased compute can lead to better defense mechanisms, these

๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ
Adversarial reasoning is born. 
Hot take: The core problem we address in this paper is the role of reasoning in AI safety. While there have been recent efforts by <a href="/OpenAI/">OpenAI</a> arguing that replacing reasoning with increased compute can lead to better defense mechanisms, these
Aaron Roth (@aaroth) 's Twitter Profile Photo

What are prediction sets good for? It turns out just as calibration is the "right" way of quantifying uncertainty for risk-neutral (expectation maximizing) decision makers, prediction sets are the right way of quantifying uncertainty for risk-averse decision makers.

What are prediction sets good for? It turns out just as calibration is the "right" way of quantifying uncertainty for risk-neutral (expectation maximizing) decision makers, prediction sets are the right way of quantifying uncertainty for risk-averse decision makers.
Shayan Kiyani (@shayankiyani1) 's Twitter Profile Photo

Wondering how to make high-stakes decisions using MLโ€”in areas like medicine, robotics, or finance? Our latest work lays out a decision-theoretic foundation for risk-averse uncertainty quantification. If you want to learn how to make better calls when it truly matters, read on!

Nikolai Matni (@nikolaimatni) 's Twitter Profile Photo

Tired of the aggressive mediocrity of robust control (RC) and the unreliability of certainty equivalent (CE) control?! Then try domain randomization (DR)! We prove that DR-based control of an unknown linear system is nearly as efficient as CE control, and nearly as reliable as RC

Tired of the aggressive mediocrity of robust control (RC) and the unreliability of certainty equivalent (CE) control?! Then try domain randomization (DR)! We prove that DR-based control of an unknown linear system is nearly as efficient as CE control, and nearly as reliable as RC
Shayan Kiyani (@shayankiyani1) 's Twitter Profile Photo

We push conformal prediction and its trade-offs beyond regression & classification โ€” into query-based generative models. Surprisingly (or not?), missing mass & Good-Turing estimators emerge as key tools once again. Very excited about this one!