Interface●Since v1.0

WatsonxDeployedInputLLM

Parameters for basic llm invoke

interface WatsonxDeployedInputLLM

Bases

WatsonxLLMBasicOptionsWatsonxDeploymentLLMParams

Properties

property

authenticator: string

property

cache: boolean | BaseCache<Generation[]>

property

callbacks: Callbacks

property

idOrName: string

The id_or_name can be either the deployment_id that identifies the deployment or a serving_name that allows a predefined URL to be used to post a prediction. The deployment must reference a prompt template with input_mode chat.

The WML instance that is associated with the deployment will be used for limits and billing (if a paid plan).

property

maxConcurrency: number

The maximum number of concurrent calls that can be made. Defaults to Infinity, which means no limit.

property

maxRetries: number

The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.

property

metadata: Record<string, unknown>

property

onFailedAttempt: FailedAttemptHandler

Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.

property

promptIndex: number

property

serviceUrl: string

property

streaming: boolean

Whether to stream the results or not. Defaults to false.

property

tags: string[]

property

verbose: boolean

Whether to print out response text.

property

version: string

property

watsonxCallbacks: RequestCallbacks<any>

deprecatedproperty

callbackManager: CallbackManager

Inherited fromWatsonxDeploymentLLMParams

Properties

PidOrName: string

—

The id_or_name can be either the deployment_id that identifies the deployment or a serving_name that

View source on GitHub

WatsonxDeployedInputLLM

Bases

Properties

Inherited fromWatsonxDeploymentLLMParams

Properties

LangChain Assistant

Menu

WatsonxDeployedInputLLM

Bases

Properties

Inherited fromWatsonxDeploymentLLMParams

Properties