SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI

(arxiv.org)

49 points | by mpweiher 3 hours ago ago

3 comments