Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs
📅 2025-08-07 ⚓ Hacker News 🌐 Source 🖼️ Load Image