OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)

📅 2026-01-29    ⚓ Hacker News    🌐 Source    🖼️ Load Image