Why Current AI Guardrails Train Models to Fake Alignment

(kellyasay.substack.com)

3 points | by kellya 9 hours ago ago

1 comments