▲ 1 Beyond 80/20: High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning (arxiv.org) by mdp2021 | Apr 30, 2026 | 0 comments on HN Visit Link