Intellect-2: A Reasoning Model Trained Through Globally Decentralized RL

(arxiv.org)

1 points | by nkko 5 hours ago ago

No comments yet.