Class●Since v1.0

SummarizationMiddleware

Summarizes conversation history when token limits are approached.

This middleware monitors message token counts and automatically summarizes older messages when a threshold is reached, preserving recent messages and maintaining context continuity by ensuring AI/Tool message pairs remain together.

SummarizationMiddleware(
  self,
  model: str | BaseChatModel,
  *,
  trigger: ContextSize | list[ContextSize] | None = None,
  keep: ContextSize = ('messages', _DEFAULT_MESSAGES_TO_KEEP),
  token_counter: TokenCounter = count_tokens_approximately,
  summary_prompt: str = DEFAULT_SUMMARY_PROMPT,
  trim_tokens_to_summarize: int | None = _DEFAULT_TRIM_TOKEN_LIMIT,
  **deprecated_kwargs: Any = {}
)

Bases

AgentMiddleware[AgentState[ResponseT], ContextT, ResponseT]

Parameters

Name	Type	Description
`model`*	`str \| BaseChatModel`	The language model to use for generating summaries.
`trigger`	`ContextSize \| list[ContextSize] \| None`	Default:`None` One or more thresholds that trigger summarization. Provide a single `ContextSize` tuple or a list of tuples, in which case summarization runs when any threshold is met. Example `# Trigger summarization when 50 messages is reached ("messages", 50) # Trigger summarization when 3000 tokens is reached ("tokens", 3000) # Trigger summarization either when 80% of model's max input tokens # is reached or when 100 messages is reached (whichever comes first) [("fraction", 0.8), ("messages", 100)]` See `ContextSize` for more details.
`keep`	`ContextSize`	Default:`('messages', _DEFAULT_MESSAGES_TO_KEEP)` Context retention policy applied after summarization. Provide a `ContextSize` tuple to specify how much history to preserve. Defaults to keeping the most recent `20` messages. Does not support multiple values like `trigger`. Example `# Keep the most recent 20 messages ("messages", 20) # Keep the most recent 3000 tokens ("tokens", 3000) # Keep the most recent 30% of the model's max input tokens ("fraction", 0.3)`
`token_counter`	`TokenCounter`	Default:`count_tokens_approximately` Function to count tokens in messages.
`summary_prompt`	`str`	Default:`DEFAULT_SUMMARY_PROMPT` Prompt template for generating summaries.
`trim_tokens_to_summarize`	`int \| None`	Default:`_DEFAULT_TRIM_TOKEN_LIMIT` Maximum tokens to keep when preparing messages for the summarization call. Pass `None` to skip trimming entirely.

Constructors

constructor

__init__

Name	Type
model	str \| BaseChatModel
trigger	ContextSize \| list[ContextSize] \| None
keep	ContextSize
token_counter	TokenCounter
summary_prompt	str
trim_tokens_to_summarize	int \| None

Attributes

attribute

model: model

attribute

trigger: ContextSize | list[ContextSize] | None

summary_prompt: summary_prompt

attribute

trim_tokens_to_summarize: trim_tokens_to_summarize

Methods

method

before_model

Process messages before model invocation, potentially triggering summarization.

method

abefore_model

Process messages before model invocation, potentially triggering summarization.

Inherited fromAgentMiddleware

Attributes

Astate_schema: ShellToolState Atools: list Aname: str

—

Name of the schema, used for tool calling.

Methods

Mbefore_agent

—

Start the shell session and run startup commands.

Mabefore_agent

—

Async start the shell session and run startup commands.

Mafter_model

—

Check for parallel write_todos tool calls and return errors if detected.

Maafter_model

—

Check for parallel write_todos tool calls and return errors if detected.

Mwrap_model_call

—

Update the system message to include the todo system prompt.

Mawrap_model_call

—

Update the system message to include the todo system prompt.

Mafter_agent

—

Run shutdown commands and release resources when an agent completes.

Maafter_agent

—

Async run shutdown commands and release resources when an agent completes.

Mwrap_tool_call

—

Intercept tool execution for retries, monitoring, or modification.

Mawrap_tool_call

—

Intercept and control async tool execution via handler callback.

View source on GitHub

SummarizationMiddleware

Bases

Parameters

Constructors

Attributes

Methods

Inherited fromAgentMiddleware

Attributes

Methods

LangChain Assistant

Menu

SummarizationMiddleware

Bases

Parameters

Constructors

Attributes

Methods

Inherited fromAgentMiddleware

Attributes

Methods

SummarizationMiddleware

Bases

Used in Docs

Parameters

Constructors

Attributes

Methods

Inherited fromAgentMiddleware

Attributes

Methods

Menu

SummarizationMiddleware

Bases

Used in Docs

Parameters

Constructors

Attributes

Methods

Inherited fromAgentMiddleware

Attributes

Methods