
Jan Hendrik Kirchner
@janhkirchner
formerly comp neuroscience @ mpi brain research frankfurt ➡️ small verifier
ID: 972038953586057216
http://universalprior.substack.com 09-03-2018 09:18:40
447 Tweet
1,1K Followers
527 Following









New paper with Johannes Treutlein , Evan Hubinger , and many other coauthors! We train a model with a hidden misaligned objective and use it to run an auditing game: Can other teams of researchers uncover the model’s objective? x.com/AnthropicAI/st…




