DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

(github.com)

343 points | by aurenvale 3 hours ago ago

87 comments