Dispersion loss counteracts embedding condensation in small language models

(chenliu-1996.github.io)

20 points | by E-Reverance 2 hours ago ago

4 comments