
jasmine
@j_asminewang
control empirics lead @AISecurityInst. cofounded @verses_xyz @kernel_magazine @readtrellis @copysmith_ai
ID: 1295193837258727424
https://jasminew.me 17-08-2020 03:01:00
2,2K Tweet
6,6K Followers
1,1K Following







Cool to see folks from many parts of the AI safety ecosystem unite around this. We should study what makes models monitorable and track monitorability in system cards. Bravo to everyone involved, and thank you especially to Tomek Korbak and Mikita Balesni 🇺🇦 for leading this work!





Mikita Balesni 🇺🇦 If someone works out how to trade away this transparency in exchange for more efficiency and ushers in a new era of opaque thoughts, they may have done more than any other individual to lower the chance humanity survives this century.
