▲ 1 SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving (arxiv.org) by matt_d | Apr 22, 2026 | 0 comments on HN Visit Link