2 concepts, live from D1
Concepts
Every real context-management concept catalogued here, grouped by category unless filtered.
| concept | description |
|---|---|
| What prompt caching is | Prompt caching lets a client mark a prefix of a prompt (e.g. a long system prompt, a set of tool definitions, or a large shared document) as cacheable, so that repeated requests reusing that same prefix are billed and processed more cheaply and with lower latency than reprocessing it from scratch each time. |
| When prompt caching is most useful | Prompt caching pays off most when the same large prefix (system prompt, tool schema, long reference document, or few-shot examples) is reused across many requests in a short time window, such as a multi-turn agent session or a batch of requests over the same document. |
Machine-readable version: GET /api/context-concepts?category=prompt_caching