HN
New
Show
Ask
Jobs
Built with Astro
Why Current AI Guardrails Train Models to Fake Alignment
(kellyasay.substack.com)
3 points | by
kellya
9 hours ago ago
1 comments
9 hours ago ago
[deleted]