Login

ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math

(firethering.com) by steveharing1 | May 7, 2026 | 0 comments on HN
Visit Link
← Back to news