class PremEmbeddingsPrompt processing batch size.
The async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.
The Momento cache client.
Model name to use. Available options are: qwen-turbo, qwen-plus, qwen-max, or Other compatible models.
Class for generating embeddings using the Prem AI's API. Extends the Embeddings class and implements PremEmbeddingsParams and