Interface●Since v1.0

ChatWatsonxGatewayInput

interface ChatWatsonxGatewayInput

Bases

BaseChatModelParamsWatsonxGatewayChatParams

Properties

property

cache: boolean | BaseCache<Generation[]>

property

callbacks: Callbacks

property

configurable: Record<string, any>

Runtime values for attributes previously made configurable on this Runnable, or sub-Runnables.

property

disableStreaming: boolean

property

frequencyPenalty: number

Penalizes repeated tokens according to frequency

property

headers: OutgoingHttpHeaders

property

logitBias: JSONObject

Dictionary used to adjust the probability of specific tokens being generated

property

logprobs: boolean

Whether to return log probabilities of the output tokens or not. If true, returns the log probabilities of each output token returned in the content of message.

property

ls_structured_output_format: __type

Describes the format of structured outputs. This should be provided if an output is considered to be structured

property

maxCompletionTokens: number

The maximum number of tokens that can be generated in the chat completion. The total length of input tokens and generated tokens is limited by the model's context length. Set to 0 for the model's configured max generated tokens.

property

maxConcurrency: number

The maximum number of concurrent calls that can be made. Defaults to Infinity, which means no limit.

property

maxRetries: number

The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.

property

maxTokens: number

property

metadata: Record<string, unknown>

property

model: string

Model name to use. Available options are: qwen-turbo, qwen-plus, qwen-max, or Other compatible models.

property

modelGateway: boolean

property

modelGatewayKwargs: WatsonxGatewayChatKwargs

property

n: number

Number of completions to generate for each prompt

property

onFailedAttempt: FailedAttemptHandler

Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.

property

outputVersion: MessageOutputVersion

property

presencePenalty: number

Penalizes repeated tokens

property

promptIndex: number

property

reasoningEffort: "low" | "medium" | "high"

A lower reasoning effort can result in faster responses, fewer tokens used, and shorter reasoning_content in the responses. Supported values are low, medium, and high.

property

recursionLimit: number

Maximum number of times a call can recurse. If not provided, defaults to 25.

property

responseFormat: ChatsResponseFormat

The tool response format.

If "content" then the output of the tool is interpreted as the contents of a ToolMessage. If "content_and_artifact" then the output is expected to be a two-tuple corresponding to the (content, artifact) of a ToolMessage.

property

runId: string

Unique identifier for the tracer run for this call. If not provided, a new UUID will be generated.

property

runName: string

Name for the tracer run for this call. Defaults to the name of the class.

property

seed: number

property

serviceUrl: string

property

signal: AbortSignal

Abort signal for this call. If provided, the call will be aborted when the signal is aborted.

property

stop: string[]

Stop tokens to use for this call. If not provided, the default stop tokens for the model will be used.

property

streaming: boolean

Whether to stream the results or not. Defaults to false.

property

tags: string[]

property

temperature: number

Amount of randomness injected into the response. Ranges from 0 to 1 (0 is not included). Use temp closer to 0 for analytical / multiple choice, and temp closer to 1 for creative and generative tasks. Defaults to 0.95.

property

timeout: number

Timeout for this call in milliseconds.

property

tool_choice: WatsonxTooChoice

Specifies how the chat model should use tools.

property

tools: ChatsRequestTool[]

property

topLogprobs: number

An integer between 0 and 5 specifying the number of most likely tokens to return at each token position, each with an associated log probability. logprobs must be set to true if this parameter is used.

property

topP: number

Total probability mass of tokens to consider at each step. Range from 0 to 1.0. Defaults to 0.8.

property

verbose: boolean

Whether to print out response text.

property

version: string

property

watsonxCallbacks: RequestCallbacks<any>

deprecatedproperty

callbackManager: CallbackManager

Inherited fromWatsonxGatewayChatParams

Properties

Pcallbacks: Callbacks Pconfigurable: Record<string, any>

—

Runtime values for attributes previously made configurable on this Runnable,

PfrequencyPenalty: number

—

Penalizes repeated tokens according to frequency

Pheaders: Record<string, string>PlogitBias: Record<string, number>

—

Dictionary used to adjust the probability of specific tokens being generated

Plogprobs: boolean

—

Whether to return log probabilities of the output tokens or not.

Pls_structured_output_format: __type

—

Describes the format of structured outputs.

PmaxCompletionTokens: number

—

The maximum number of tokens that can be generated in the chat completion. The total length of input tokens

PmaxConcurrency: number

—

The maximum number of concurrent calls that can be made.

PmaxRetries: number

—

The maximum number of retries that can be made for a single call,

—

Model name to use. Available options are: qwen-turbo, qwen-plus, qwen-max, or Other compatible models.

PmodelGatewayKwargs: WatsonxGatewayChatKwargs Pn: number

—

Number of completions to generate for each prompt

PoutputVersion: MessageOutputVersion PpresencePenalty: number

—

Penalizes repeated tokens

PpromptIndex: number PreasoningEffort: "low" | "medium" | "high"

—

A lower reasoning effort can result in faster responses, fewer tokens used, and shorter reasoning_content in the responses.

PrecursionLimit: number

—

Maximum number of times a call can recurse. If not provided, defaults to 25.

PresponseFormat: string

—

The tool response format.

PrunId: string

—

Unique identifier for the tracer run for this call. If not provided, a new UUID

PrunName: string

—

Name for the tracer run for this call. Defaults to the name of the class.

Pseed: number PserviceUrl: string Psignal: AbortSignal

—

Abort signal for this call.

Pstop: string[]

—

Stop tokens to use for this call.

Pstreaming: boolean

—

Whether to stream the results or not. Defaults to false.

Ptags: string[]Ptemperature: number

—

Amount of randomness injected into the response. Ranges

Ptimeout: number

—

Timeout for this call in milliseconds.

Ptool_choice: ToolChoice

—

Specifies how the chat model should use tools.

Ptools: ToolInterface<StringInputToolSchema, any, any>[]PtopLogprobs: number

—

An integer between 0 and 5 specifying the number of most likely tokens to return at each token position,

PtopP: number

—

Total probability mass of tokens to consider at each step. Range

Pversion: string PwatsonxCallbacks: RequestCallbacks<any>

Inherited fromWatsonxCallOptionsGatewayChat

Properties

Pcallbacks: Callbacks Pconfigurable: Record<string, any>

—

Runtime values for attributes previously made configurable on this Runnable,

PfrequencyPenalty: number

—

Penalizes repeated tokens according to frequency

Pheaders: Record<string, string>PlogitBias: Record<string, number>

—

Dictionary used to adjust the probability of specific tokens being generated

Plogprobs: boolean

—

Whether to return log probabilities of the output tokens or not.

Pls_structured_output_format: __type

—

Describes the format of structured outputs.

PmaxCompletionTokens: number

—

The maximum number of tokens that can be generated in the chat completion. The total length of input tokens

PmaxConcurrency: number

—

The maximum number of concurrent calls that can be made.

PmaxRetries: number

—

The maximum number of retries that can be made for a single call,

—

Model name to use. Available options are: qwen-turbo, qwen-plus, qwen-max, or Other compatible models.

PmodelGatewayKwargs: WatsonxGatewayChatKwargs Pn: number

—

Number of completions to generate for each prompt

PoutputVersion: MessageOutputVersion PpresencePenalty: number

—

Penalizes repeated tokens

PpromptIndex: number PreasoningEffort: "low" | "medium" | "high"

—

A lower reasoning effort can result in faster responses, fewer tokens used, and shorter reasoning_content in the responses.

PrecursionLimit: number

—

Maximum number of times a call can recurse. If not provided, defaults to 25.

PresponseFormat: string

—

The tool response format.

PrunId: string

—

Unique identifier for the tracer run for this call. If not provided, a new UUID

PrunName: string

—

Name for the tracer run for this call. Defaults to the name of the class.

Pseed: number Psignal: AbortSignal

—

Abort signal for this call.

Pstop: string[]

—

Stop tokens to use for this call.

Pstreaming: boolean

—

Whether to stream the results or not. Defaults to false.

Ptags: string[]Ptemperature: number

—

Amount of randomness injected into the response. Ranges

Ptimeout: number

—

Timeout for this call in milliseconds.

Ptool_choice: ToolChoice

—

Specifies how the chat model should use tools.

Ptools: ToolInterface<StringInputToolSchema, any, any>[]PtopLogprobs: number

—

An integer between 0 and 5 specifying the number of most likely tokens to return at each token position,

PtopP: number

—

Total probability mass of tokens to consider at each step. Range

PwatsonxCallbacks: RequestCallbacks<any>

View source on GitHub

ChatWatsonxGatewayInput

Bases

Properties

Inherited fromWatsonxGatewayChatParams

Properties

Inherited fromWatsonxCallOptionsGatewayChat

Properties

LangChain Assistant

Menu

ChatWatsonxGatewayInput

Bases

Properties

Inherited fromWatsonxGatewayChatParams

Properties

Inherited fromWatsonxCallOptionsGatewayChat

Properties