llm-response-caching-layer

Implement semantic and exact-match caching for LLM responses to reduce cost 40-60% and latency. Activate on: LLM caching, semantic cache, reduce API costs, cache AI responses. NOT for: general web caching (caching-strategies), CDN config (cloudflare-worker-dev).

Uncategorized

Allowed Tools

ReadWriteEditBash(python:*pip:*npm:*npx:*)

Share this skill

Twitter LinkedIn

Skills use the open SKILL.md standard — the same file works across all platforms.

Install all 544 skills as a plugin

claude plugin marketplace add curiositech/windags-skills claude plugin install windags-skills

Claude activates llm-response-caching-layer automatically when your task matches its description.

View on GitHub