Interface●Since v1.0

WatsonxCallDeployedParams

interface WatsonxCallDeployedParams

Bases

DeploymentsTextChatParams

Properties

property

context: string

property

frequencyPenalty: number

Penalizes repeated tokens according to frequency

property

headers: OutgoingHttpHeaders

property

idOrName: string

The id_or_name can be either the deployment_id that identifies the deployment or a serving_name that allows a predefined URL to be used to post a prediction. The deployment must reference a prompt template with input_mode chat.

The WML instance that is associated with the deployment will be used for limits and billing (if a paid plan).

property

includeReasoning: boolean

Whether to include reasoning_content in the response. Default is true.

property

lengthPenalty: number

property

logitBias: JsonObject

Dictionary used to adjust the probability of specific tokens being generated

property

logprobs: boolean

Whether to return log probabilities of the output tokens or not. If true, returns the log probabilities of each output token returned in the content of message.

property

maxCompletionTokens: number

The maximum number of tokens that can be generated in the chat completion. The total length of input tokens and generated tokens is limited by the model's context length. Set to 0 for the model's configured max generated tokens.

property

maxTokens: number

property

messages: DeploymentTextChatMessages[]

property

n: number

Number of completions to generate for each prompt

property

presencePenalty: number

Penalizes repeated tokens

property

reasoningEffort: "low" | "medium" | "high"

A lower reasoning effort can result in faster responses, fewer tokens used, and shorter reasoning_content in the responses. Supported values are low, medium, and high.

property

repetitionPenalty: number

Penalizes repeated tokens according to frequency. Range from 1.0 to 2.0. Defaults to 1.0.

property

responseFormat: TextChatResponseFormat

The tool response format.

If "content" then the output of the tool is interpreted as the contents of a ToolMessage. If "content_and_artifact" then the output is expected to be a two-tuple corresponding to the (content, artifact) of a ToolMessage.

property

seed: number

property

signal: AbortSignal

Abort signal for this call. If provided, the call will be aborted when the signal is aborted.

property

stop: string[]

Stop tokens to use for this call. If not provided, the default stop tokens for the model will be used.

property

temperature: number

Amount of randomness injected into the response. Ranges from 0 to 1 (0 is not included). Use temp closer to 0 for analytical / multiple choice, and temp closer to 1 for creative and generative tasks. Defaults to 0.95.

property

timeLimit: number

Time limit in milliseconds - if not completed within this time, generation will stop. The text generated so far will be returned along with the `TIME_LIMIT`` stop reason. Depending on the users plan, and on the model being used, there may be an enforced maximum time limit.

property

toolChoice: TextChatToolChoiceTool

property

toolChoiceOption: string

property

tools: TextChatParameterTools[]

property

topLogprobs: number

An integer between 0 and 5 specifying the number of most likely tokens to return at each token position, each with an associated log probability. logprobs must be set to true if this parameter is used.

property

topP: number

Total probability mass of tokens to consider at each step. Range from 0 to 1.0. Defaults to 0.8.

View source on GitHub

Properties

property

context: string

property

frequencyPenalty: number

Penalizes repeated tokens according to frequency

property

headers: OutgoingHttpHeaders

property

idOrName: string

The WML instance that is associated with the deployment will be used for limits and billing (if a paid plan).

property

includeReasoning: boolean

Whether to include reasoning_content in the response. Default is true.

property

lengthPenalty: number

property

logitBias: JsonObject

Dictionary used to adjust the probability of specific tokens being generated

property

logprobs: boolean

Whether to return log probabilities of the output tokens or not. If true, returns the log probabilities of each output token returned in the content of message.

property

maxCompletionTokens: number

property

maxTokens: number

property

messages: DeploymentTextChatMessages[]

property

n: number

Number of completions to generate for each prompt

property

presencePenalty: number

Penalizes repeated tokens

property

reasoningEffort: "low" | "medium" | "high"

A lower reasoning effort can result in faster responses, fewer tokens used, and shorter reasoning_content in the responses. Supported values are low, medium, and high.

property

repetitionPenalty: number

Penalizes repeated tokens according to frequency. Range from 1.0 to 2.0. Defaults to 1.0.

property

responseFormat: TextChatResponseFormat

The tool response format.

property

seed: number

property

signal: AbortSignal

Abort signal for this call. If provided, the call will be aborted when the signal is aborted.

property

stop: string[]

Stop tokens to use for this call. If not provided, the default stop tokens for the model will be used.

property

temperature: number

property

timeLimit: number

property

toolChoice: TextChatToolChoiceTool

property

toolChoiceOption: string

property

tools: TextChatParameterTools[]

property

topLogprobs: number

property

topP: number

Total probability mass of tokens to consider at each step. Range from 0 to 1.0. Defaults to 0.8.

WatsonxCallDeployedParams

Bases

Properties

LangChain Assistant

Menu

WatsonxCallDeployedParams

Bases

Properties