Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

(arstechnica.com)

16 points | by gmays a day ago ago

2 comments

redanddead a day ago ago

You'd think it'd be bigger news on hn
[-]
- axiologist a day ago ago
  
  See https://news.ycombinator.com/item?id=47513475 from two days ago.