R-Zero: Self-Evolving Reasoning LLM from Zero Data

(arxiv.org)

121 points | by lawrenceyan 4 days ago ago

63 comments