▲ 1 ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math (firethering.com) by steveharing1 | May 7, 2026 | 0 comments on HN Visit Link