Two different tricks for fast LLM inference

(seangoedecke.com)

38 points | by swah 3 hours ago ago

19 comments