Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation

(arxiv.org)

43 points | by PaulHoule a day ago ago

3 comments