▲ 1 Safety benchmarks are inflated because models know they're being tested (lesswrong.chttps:) by aranguri | May 4, 2026 | 0 comments on HN Visit Link