Show HN: AST-guard A gradient-immune structural guard against RL reward hacking

(github.com)

3 points | by thinking-nick 7 hours ago ago

1 comments