Login

Fastest LLM decode engine on Apple Silicon. 658 tok/s on M4-Max,beats MLX by 19%

(runanywhere.ai) by sanchitmonga | Mar 7, 2026 | 1 comments on HN
Visit Link
← Back to news