Why does GPT-5.1 Codex underperform GPT-5 Codex on Terminal-Bench?

(transluce.org)

9 points | by mengk 5 hours ago ago

1 comments