Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

(dao-lab.ai)

2 points | by matt_d 10 hours ago ago

No comments yet.