Latest Top Search Submit

Login

Fastest LLM decode engine on Apple Silicon. 658 tok/s on M4-Max,beats MLX by 19%

(runanywhere.ai) by sanchitmonga | Mar 7, 2026 | 1 comments on HN

Visit Link

← Back to news