▲ 105 OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%) (quesma.com) by stared | Jan 29, 2026 | 56 comments on HN Visit Link