Constructor input interfaces for each mode
interface WatsonxInputLLMWatsonxLLMBasicOptionsWatsonxLLMParamsThe maximum number of concurrent calls that can be made.
Defaults to Infinity, which means no limit.
The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
Model name to use. Available options are: qwen-turbo, qwen-plus, qwen-max, or Other compatible models.
Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
The project that contains the resource. Either space_id or project_id has to be given.
Penalizes repeated tokens according to frequency. Range from 1.0 to 2.0. Defaults to 1.0.
Abort signal for this call. If provided, the call will be aborted when the signal is aborted.
The space that contains the resource. Either space_id or project_id has to be given.
Whether to stream the results or not. Defaults to false.
Amount of randomness injected into the response. Ranges from 0 to 1 (0 is not included). Use temp closer to 0 for analytical / multiple choice, and temp closer to 1 for creative and generative tasks. Defaults to 0.95.
Time limit in milliseconds - if not completed within this time, generation will stop. The text generated so far will be returned along with the `TIME_LIMIT`` stop reason. Depending on the users plan, and on the model being used, there may be an enforced maximum time limit.
Total probability mass of tokens to consider at each step. Range from 0 to 1.0. Defaults to 0.8.
Whether to print out response text.
Penalizes repeated tokens according to frequency. Range
Abort signal for this call.
Amount of randomness injected into the response. Ranges
Time limit in milliseconds - if not completed within this time, generation will stop. The text generated so
Total probability mass of tokens to consider at each step. Range