Class●Since v1.0

AnthropicPromptCachingMiddleware

Prompt Caching Middleware.

Optimizes API usage by caching conversation prefixes for Anthropic models.

Requires both langchain and langchain-anthropic packages to be installed.

Learn more about Anthropic prompt caching here.

AnthropicPromptCachingMiddleware(
  self,
  type: Literal['ephemeral'] = 'ephemeral',
  ttl: Literal['5m', '1h'] = '5m',
  min_messages_to_cache: int = 0,
  unsupported_model_behavior: Literal['ignore', 'warn', 'raise'] = 'warn'
)

Bases

AgentMiddleware

Parameters

Name	Type	Description
`type`	`Literal['ephemeral']`	Default:`'ephemeral'` The type of cache to use, only `'ephemeral'` is supported.
`ttl`	`Literal['5m', '1h']`	Default:`'5m'` The time to live for the cache, only `'5m'` and `'1h'` are supported.
`min_messages_to_cache`	`int`	Default:`0` The minimum number of messages until the cache is used.
`unsupported_model_behavior`	`Literal['ignore', 'warn', 'raise']`	Default:`'warn'` The behavior to take when an unsupported model is used. `'ignore'` will ignore the unsupported model and continue without caching. `'warn'` will warn the user and continue without caching. `'raise'` will raise an error and stop the agent.