Login

OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)

(quesma.com) by stared | Jan 29, 2026 | 56 comments on HN
Visit Link
← Back to news