LocalAI embedding models.
Javelin AI Gateway embeddings.
Fake embedding model.
Fake embedding model that always returns
Custom exception for interfacing with Takeoff Embedding class.
Exception raised when no consumer group is provided on initialization of
Device to use for inference, cuda or cpu.
Configuration for the reader to be deployed in Takeoff.
Interface with Takeoff Inference API for embedding models.
Content handler for LLM class.
Custom Sagemaker Inference Endpoints.
Google's PaLM Embeddings APIs.
Tencent Hunyuan embedding models API by Tencent.
NCP ClovaStudio Embedding API.
OCI authentication types as enumerator.
OCI embedding models.
Payload for the Embaas embeddings API.
Embaas's embedding service.
MiniMax embedding model integration.
JohnSnowLabs embedding models
Baichuan Text Embedding models.
URL class for parsing the URL.
SparkLLM embedding model integration.
Exception raised for errors in the header assembly.
MLflow AI Gateway embeddings.
EdenAI embedding.
NLP Cloud embedding models.
MosaicML embedding service.
TensorflowHub embedding models.
Embedding LLMs in MLflow.
Cohere embedding LLMs in MLflow.
Embeddings by spaCy models.
llama.cpp embedding models.
Anyscale Embeddings API.
Prem's Embedding APIs
Volcengine Embeddings embedding models.
OctoAI Compute Service embedding models.
OpenVINO embedding models.
OpenVNO BGE embedding models.
Embedding documents and queries with Awa DB.
ModelScopeHub embedding models.
Ascend NPU accelerate Embedding model
text2vec embedding models.
Jina embedding models.
HuggingFace embedding models on self-hosted remote hardware.
HuggingFace InstructEmbedding models on self-hosted remote hardware.
Custom embedding models on self-hosted remote hardware.
Bookend AI sentence_transformers embedding models.
ZhipuAI embedding model integration.
LLMRails embedding models.
YandexGPT Embeddings models.
GPT4All embedding models.
Gradient.ai Embedding models.
Deprecated, TinyAsyncGradientEmbeddingClient was removed.
Llamafile lets you distribute and run large language models with a
OVHcloud AI Endpoints Embeddings.
Model2Vec embedding models.
Quantized bi-encoders embedding models.
A class to handle embedding requests to the TextEmbed API.
A client to handle synchronous and asynchronous requests to the TextEmbed API.
Optimized Infinity embedding models.
DashScope embedding models.
Qdrant FastEmbedding models.
Xinference embedding models.
Baidu Qianfan Embeddings embedding models.
Self-hosted embedding models for infinity package.
Helper tool to embed Infinity.
Deep Infra's embedding inference service.
Leverage Itrex runtime to unlock the performance of compressed NLP models.
Aleph Alpha's asymmetric semantic embedding.
Symmetric version of the Aleph Alpha's semantic embeddings.
Clarifai embedding models.
LASER Language-Agnostic SEntence Representations.
Wrapper around the BGE embedding model
GigaChat Embeddings models.
Get Embeddings
NeMo embedding models.
Cohere embedding models.
Elasticsearch embedding models.
OpenAI embedding models.
SambaNova embedding models.
HuggingFace sentence_transformers embedding models.
Wrapper around sentence_transformers embedding models.
HuggingFace sentence_transformers embedding models.
Embed texts using the HuggingFace API.
Ernie Embeddings V1 embedding models.
Google Cloud VertexAI embedding models.
Solar's embedding service.
Ollama locally runs large language models.
Bedrock embedding models.
Voyage embedding models.
HuggingFaceHub embedding models.
Databricks embeddings.
Cloudflare Workers AI embedding model.
Clova's embedding service.
Azure OpenAI Embeddings API.
Use tenacity to retry the embedding call.
Use tenacity to retry the embedding call.
Check if an endpoint is live by sending a GET request to the specified URL.
Use tenacity to retry the completion call.
Use tenacity to retry the completion call.
Use tenacity to retry the embedding call.
Use tenacity to retry the embedding call.
Create a retry decorator for PremAIEmbeddings.
Using tenacity for retry in embedding calls
Use tenacity to retry the completion call.
Check if a URL is a local file.
Get the bytes string of a file.
Load the embedding model.
Use tenacity to retry the embedding call.
Use tenacity to retry the embedding call.