KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit

(arxiv.org)

43 points | by EGreg 3 hours ago ago

36 comments