Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs

📅 2025-08-07    ⚓ Hacker News    🌐 Source    🖼️ Load Image