FineWeb2: Adapting Pre-Training Data Processing to Every Language

(arxiv.org)

7 points | by hynky 2 days ago ago

No comments yet.