monoxgas (@monoxgas) Twitter Tweets • TwiCopy

monoxgas

@monoxgas

+ Follow

Security engineering, research, exploits, ml.

Co-Founder with @moo_hax at @dreadnode

ID: 199907473

calendar_today08-10-2010 00:38:44

333 Tweet

4,4K Followers

370 Following

moo

@moo_hax

2 years ago

developer.nvidia.com/blog/nvidia-ai…

thumb_up_off_alt90

chat_bubble_outline2

repeat31

shareShare

Are aligned neural networks adversarially aligned? Nicholas Carlini, Milad Nasr (Milad Nasr), Christopher A. Choquette-Choo, Matthew Jagielski, Irena Gao, Anas Awadalla, Pang Wei Koh, Daphne Ippolito (Daphne Ippolito), Katherine Lee (Katherine Lee), Florian Tramèr, Ludwig Schmidt

thumb_up_off_alt23

chat_bubble_outline0

repeat6

shareShare

moo

@moo_hax

2 years ago

If you happened to miss BHUS, we’ll be at Blackhat EU blackhat.com/eu-23/training…

thumb_up_off_alt11

chat_bubble_outline1

repeat8

shareShare

moo

@moo_hax

2 years ago

Some players are handling the CTF format better than others (meme from the Discord). Everyone is learning…something. 12 days left, still time to hit the leaderboard! kaggle.com/competitions/a…

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

monoxgas

@monoxgas

2 years ago

The most common ask we got after the AI Village @ DEF CON CTF on Kaggle was to make the challenges available all the time. We took our first steps today and look forward to building out a great ML CTF and learning platform. Hope you enjoy!

thumb_up_off_alt14

chat_bubble_outline2

repeat3

shareShare

monoxgas

@monoxgas

2 years ago

Shout to Rob for the 4 new Bear challenges. Awesome place to get started with great walkthroughs. The roadmap is looking 🔥this year

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

monoxgas

@monoxgas

a year ago

I took an early stab at PGD for LLMs based on arxiv.org/abs/2402.09154 (Simon Geisler). Neat technique to relax the one-hot for gradient updates + projection. Also got to spend some time with litgpt. github.com/dreadnode/rese… Experimental and messy, but enjoy.

thumb_up_off_alt22

chat_bubble_outline2

repeat7

shareShare

monoxgas

@monoxgas

a year ago

local vllm generator dropping in rigging soon. batch inference speed is going to be very useful.

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

monoxgas

@monoxgas

a year ago

Pushed with vllm and transformers support rigging.dreadnode.io/topics/generat…

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

monoxgas

@monoxgas

a year ago

what are we even doing anymore github.com/guardrails-ai/…

thumb_up_off_alt17

chat_bubble_outline4

repeat0

shareShare

monoxgas

@monoxgas

8 months ago

Crazy ride so far. Will and I continue to learn the importance of having a great team around you. I'll take my time here and extend a huge thank you to the dreadnode team who work extremely hard everyday to build a company with us. You all rock.

thumb_up_off_alt26

chat_bubble_outline1

repeat3

shareShare

dreadnode

@dreadnode

4 months ago

Introducing AIRTBench, an AI red teaming benchmark for evaluating language models’ ability to autonomously discover and exploit AI/ML security vulnerabilities. Read the paper on arXiv: arxiv.org/abs/2506.14682 Open-source dataset and benchmark eval code repo:

thumb_up_off_alt81

chat_bubble_outline1

repeat27

shareShare

monoxgas

moo

AI Safety Papers

moo

moo

monoxgas

monoxgas

monoxgas

monoxgas

monoxgas

monoxgas

monoxgas

dreadnode