▲ 1 Skipping 90% of KV dequant work speeds up LLM decode by 22% (github.com) by pidtom | Mar 27, 2026 | 1 comments on HN Visit Link