OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)

⚓ IT    📅 2026-01-29    👤 surdeus    👁️ 1      

surdeus

Comments 🏷️ IT_feed