Surpassing vLLM with a Generated Inference Stack

(infinity.inc)

24 points | by lukebechtel 6 hours ago ago

6 comments