Menu

Overview Agents Middleware Models Messages Tools Embeddings

Language

Theme

Class●Since v1.0

LLMToolSelectorMiddleware

Uses an LLM to select relevant tools before calling the main model.

When an agent has many tools available, this middleware filters them down to only the most relevant ones for the user's query. This reduces token usage and helps the main model focus on the right tools.

LLMToolSelectorMiddleware(
  self,
  *,
  model: str | BaseChatModel | None = None,
  system_prompt: str = DEFAULT_SYSTEM_PROMPT,
  max_tools: int | None = None,
  always_include: list[str] | None = None
)

Bases

AgentMiddleware[AgentState[ResponseT], ContextT, ResponseT]

Parameters

Name	Type	Description
`model`	`str \| BaseChatModel \| None`	Default:`None` Model to use for selection. If not provided, uses the agent's main model. Can be a model identifier string or `BaseChatModel` instance.
`system_prompt`	`str`	Default:`DEFAULT_SYSTEM_PROMPT` Instructions for the selection model.
`max_tools`	`int \| None`	Default:`None` Maximum number of tools to select. If the model selects more, only the first `max_tools` will be used. If not specified, there is no limit.
`always_include`	`list[str] \| None`	Default:`None` Tool names to always include regardless of selection. These do not count against the `max_tools` limit.

Constructors

constructor

__init__

Name	Type
model	str \| BaseChatModel \| None
system_prompt	str
max_tools	int \| None
always_include	list[str] \| None

Attributes

system_prompt: system_prompt

max_tools: max_tools

model: BaseChatModel | None

Methods

wrap_model_call

Filter tools based on LLM selection before invoking the model via handler.

awrap_model_call

Filter tools based on LLM selection before invoking the model via handler.

Inherited fromAgentMiddleware

Attributes

Astate_schema: ShellToolState Atools: list Aname: str

Name of the schema, used for tool calling.

Methods

Start the shell session and run startup commands.

Async start the shell session and run startup commands.

Check model call limits before making a model call.

Async check model call limits before making a model call.

Check for parallel write_todos tool calls and return errors if detected.

Check for parallel write_todos tool calls and return errors if detected.

Run shutdown commands and release resources when an agent completes.

Async run shutdown commands and release resources when an agent completes.

Mwrap_tool_call

Intercept tool execution for retries, monitoring, or modification.

Mawrap_tool_call

Intercept and control async tool execution via handler callback.

View source on GitHub