We reproduced Anthropic's Mythos findings with public models

(blog.vidocsecurity.com)

92 points | by __natty__ 3 hours ago ago

48 comments