Combining LLMs Rarely Beats the Best Single Model, I tested 67 frontier models

(arxiv.org)

1 points | by josefchen 6 hours ago ago

No comments yet.