vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep

📅 2026-01-13    ⚓ Hacker News    🌐 Source    🖼️ Load Image