class HuggingFaceInferenceEmbeddingsThe async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.
The Momento cache client.
Model name to use. Available options are: qwen-turbo, qwen-plus, qwen-max, or Other compatible models.
Class that extends the Embeddings class and provides methods for generating embeddings using Hugging Face models through the HuggingFaceInference API.