Login

Beyond 80/20: High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning

(arxiv.org) by mdp2021 | Apr 30, 2026 | 0 comments on HN
Visit Link
← Back to news