Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents

(thinkwright.ai)

1 points | by oceanwaves 10 hours ago ago

1 comments