▲ 1 Confidence estimation is a better metric than agreement for LLM judges (arxiv.org) by rapiddev | Jun 23, 2026 | 0 comments on HN Visit Link