▲ 1 Sophia: A Scalable Second-Order Optimizer for Language Model Pre-Training (arxiv.org) by Anon84 | Apr 23, 2026 | 0 comments on HN Visit Link