News
Latest
Top
Search
Submit
Login
Search
▲
33
We architected an edge caching layer to eliminate cold starts
(mintlify.com)
by skeptrune |
view
|
23 comments
▲
11
GNOME GitLab Git traffic caching
(dragonsreach.it)
by JNRowe |
view
|
0 comments
▲
6
How Prompt Caching Works – Paged Attention and Automatic Prefix Caching
(sankalp.bearblog.dev)
by mji |
view
|
0 comments
▲
5
Tested OpenAI's prompt caching across models. Found undocumented behavior
by harsharanga |
view
|
0 comments
▲
4
Show HN: Agent-cache – Multi-tier LLM/tool/session caching for Valkey and Redis
by kaliades |
view
|
0 comments
▲
3
PostgreSQL Materialized Views: When Caching Your Query Results Makes Sense
(stormatics.tech)
by ioololaa |
view
|
0 comments
▲
3
Kv.js: Advanced in-memory caching for JavaScript
(npmjs.com)
by ent101 |
view
|
0 comments
▲
3
GitHub Actions broke caching on macOS
(github.com)
by twp |
view
|
0 comments
▲
3
Query Plan Caching
(buttondown.com)
by ibobev |
view
|
0 comments
▲
2
Lessons from Building Claude Code: Prompt Caching Is Everything
(twitter.com)
by mfiguiere |
view
|
0 comments
▲
2
Caching is better than mocking
(federicopereiro.com)
by todsacerdoti |
view
|
0 comments
▲
2
Rails update: per-adapter migration, hash-format support, MemoryStore caching
(rubyonrails.org)
by andrewstetsenko |
view
|
0 comments
▲
2
Coolify accidentally broke Docker layer caching (and what you can do now)
(loopwerk.io)
by kjmr |
view
|
0 comments
▲
2
Show HN: Add semantic caching to LLM APIs with one-line-of-code
(kentocloud.com)
by andreysheva |
view
|
1 comments
▲
2
Query Plan Caching
(buttondown.com)
by vinhnx |
view
|
0 comments
▲
2
Show HN: A pragmatic SQLite schema for application-level caching
(gist.github.com)
by ebenes |
view
|
0 comments
▲
1
FP8 Search and KV-Caching in USearch
(unum.cloud)
by ashvardanian |
view
|
0 comments
▲
1
Show HN: Seamless – Content-addressed computation caching for Python and bash
(github.com)
by sjdv1982 |
view
|
0 comments
▲
1
Caching Strategies from Scratch
(vaibhavacharya.github.io)
by vaibhavacharya_ |
view
|
0 comments
▲
1
The Complete Guide to Inference Caching in LLMs
(machinelearningmastery.com)
by eigenBasis |
view
|
0 comments
▲
1
Interval-Aware Caching for Druid at Netflix Scale
(netflixtechblog.com)
by wb14123 |
view
|
0 comments
▲
1
AI agent with semantic caching and local embeddings, one runtime
(github.com)
by sg-hdb |
view
|
0 comments
▲
1
Prompt Caching from First Principles, blog with an AI co-author
(lossfn.com)
by rcdexta |
view
|
1 comments
▲
1
We cut our agent's API costs by 10x with prompt caching
(kern-ai.com)
by obilgic |
view
|
0 comments
▲
1
Alan Cache – the best caching library?
(medium.com)
by damsieboy |
view
|
0 comments
▲
1
Prefix caching for LLM inference optimization
(bentoml.com)
by eigenBasis |
view
|
0 comments
▲
1
QuantumLeap: 2.3× faster MoE inference with intelligent expert caching
(github.com)
by ikharoz |
view
|
0 comments
▲
1
Incident March 30th, 2026 – Accidental CDN Caching
(blog.railway.com)
by cebert |
view
|
0 comments
▲
1
Railway CDN Caching Incident: When Opt-In Becomes Opt-Everyone-In
(joshuabellew.com)
by kawsper |
view
|
0 comments
▲
1
Caching algorithms without knowing how they work
(blog.autorouting.com)
by juanpabloaj |
view
|
0 comments
▲
1
Show HN: AI Cost Firewall – OpenAI-compatible gateway with semantic caching
(github.com)
by vcaluser |
view
|
0 comments
▲
1
Alan Cache – the best caching library? (Part 1)
(medium.com)
by damsieboy |
view
|
0 comments
▲
1
AI Optimizer – OpenAI API Caching Proxy (20-40% Cost Savings)
(github.com)
by adamday75 |
view
|
0 comments
▲
1
Show HN: Agent Caching in Fiddler
(telerik.com)
by zlatkov |
view
|
0 comments
▲
1
Hedystia DB – A type-safe ORM for TypeScript with smart caching
(docs.hedystia.com)
by Zastinian |
view
|
0 comments
▲
1
Prompt-caching – auto-injects Anthropic cache breakpoints (90% token savings)
(prompt-caching.ai)
by ermis |
view
|
1 comments
▲
1
Type-Safe Caching
(encore.dev)
by andout_ |
view
|
0 comments
▲
1
Deep Dive on Prompt Caching
(claudecodecamp.com)
by begemotz |
view
|
0 comments
▲
1
Show HN: Nexus Gateway – Reduce LLM API Costs Using Semantic Caching
(nexus-gateway.org)
by Sunnyanand_dev |
view
|
0 comments
▲
1
How prompt caching works in Claude Code: experiments and architectural lessons
(claudecodecamp.com)
by aray07 |
view
|
0 comments
▲
1
Show HN: I built a serverless wrapper for the EU VIES with caching and webhooks
(vatflow.net)
by QuiCreatDev |
view
|
1 comments
▲
1
vLLM-mlx – 65 tok/s LLM inference on Mac with tool calling and prompt caching
(github.com)
by raullen |
view
|
1 comments
▲
1
Prompt Caching 201
(developers.openai.com)
by tosh |
view
|
0 comments
▲
1
Lessons from Building Claude Code: Prompt Caching Is Everything
(twitter.com)
by tosh |
view
|
0 comments
▲
1
Show HN: OMLX – MLX inference server with paged SSD KV caching for Apple Silicon
(github.com)
by jundot |
view
|
0 comments
▲
1
Prompt Caching 201
(developers.openai.com)
by gmays |
view
|
0 comments
▲
1
Show HN: callonce-go – singleflight and per-request caching for Go services
(github.com)
by probablyarth |
view
|
0 comments
▲
1
Show HN: Omni Cache – friendly sidecar for CI caching needs
(github.com)
by fkorotkov |
view
|
0 comments
▲
1
Show HN: Nexus Gateway – A self-healing AI gateway in Go with 5ms caching
(nexus-gateway.org)
by Sunnyanand_dev |
view
|
0 comments
▲
1
Linux 7.0 Aims to Replace More Caching Code with Sheaves
(phoronix.com)
by rbanffy |
view
|
0 comments