Login

Cassandra: Enabling Reasoning LLMs at Edge via Self-Speculative Decoding

(arxiv.org) by chrsw | May 29, 2026 | 0 comments on HN
Visit Link
← Back to news