OptionalanthropicAnthropic API key
OptionalanthropicAnthropic API URL
OptionalapiAnthropic API key
OptionalbetasOptional array of beta features to enable for the Anthropic API. Beta features are experimental capabilities that may change or be removed. See https://docs.claude.com/en/api/beta-headers for available beta features.
OptionalclientOverridable Anthropic ClientOptions
OptionalcontextConfiguration for context management. See https://docs.claude.com/en/docs/build-with-claude/context-editing
OptionalcreateOptional method that returns an initialized underlying Anthropic client. Useful for accessing Anthropic models hosted on other cloud services such as Google Vertex.
OptionalinferenceSpecifies the geographic region for inference processing. US-only inference is available at 1.1x pricing for models released after February 1, 2026.
OptionalinvocationHolds any additional parameters that are valid to pass to anthropic.messages that are not explicitly specified on this class.
OptionalmaxA maximum number of tokens to generate before stopping.
OptionalmodelModel name to use
OptionalmodelOptionaloutputConfiguration options for the model's output, such as effort level and output format. The effort parameter controls how many tokens Claude uses when responding, trading off between response thoroughness and token efficiency.
Effort levels: "low", "medium", "high" (default), "max" (Opus 4.6 only).
OptionalstopA list of strings upon which to stop generating.
You probably want ["\n\nHuman:"], as that's the cue for
the next turn in the dialog agent.
OptionalstreamingWhether to stream the results or not
OptionalstreamWhether or not to include token usage data in streamed chunks.
OptionaltemperatureAmount of randomness injected into the response. Ranges from 0 to 1. Use temperature closer to 0 for analytical / multiple choice, and temperature closer to 1 for creative and generative tasks.
OptionalthinkingOptions for extended thinking.
OptionaltopOnly sample from the top K options for each subsequent token. Used to remove "long tail" low probability responses.
OptionaltopDoes nucleus sampling, in which we compute the cumulative distribution over all the options for each subsequent token in decreasing probability order and cut it off once it reaches a particular probability specified by top_p. Note that you should either alter temperature or top_p, but not both.
Input to AnthropicChat class.