LLMs

GigaChat

GigaChat large language models API.

BaseFriendli

Base class of Friendli.

Friendli

Friendli LLM.

Params

Parameters for the Javelin AI Gateway LLM.

JavelinAIGateway

Javelin AI Gateway LLMs.

Writer

Writer large language models.

AI21PenaltyData

Parameters for AI21 penalty data.

AI21

AI21 large language models.

FakeListLLM

Fake LLM for testing purposes.

FakeStreamingListLLM

Fake streaming list LLM for testing purposes.

Banana

Banana large language models.

ManifestWrapper

HazyResearch's Manifest library.

You

Wrapper around You.com's conversational Smart and Research APIs.

WeightOnlyQuantPipeline

Weight only quantized model.

Device

The device to use for inference, cuda or cpu

ReaderConfig

Configuration for the reader to be deployed in Titan Takeoff API.

TitanTakeoff

Titan Takeoff API LLMs.

YiLLM

Yi large language models.

LineIterator

Parse the byte stream input.

ContentHandlerBase

Handler class to transform input from LLM to a

LLMContentHandler

Content handler for LLM class.

Predibase

Use your Predibase models with Langchain.

Aphrodite

Aphrodite language model.

MLXPipeline

MLX Pipeline API.

Params

Parameters for the MLflow AI Gateway LLM.

MlflowAIGateway

MLflow AI Gateway LLMs.

Nebula

Nebula Service models.

OCIAuthType

OCI authentication types as enumerator.

OCIGenAIBase

Base class for OCI GenAI models

OCIGenAI

OCI large language models.

AzureMLEndpointClient

AzureML Managed Endpoint client.

AzureMLEndpointApiType

Azure ML endpoints API types. Use dedicated for models deployed in hosted

ContentFormatterBase

Transform request and response of AzureML endpoint to match with

GPT2ContentFormatter

Content handler for GPT2

OSSContentFormatter

Deprecated: Kept for backwards compatibility

HFContentFormatter

Content handler for LLMs from the HuggingFace catalog.

DollyContentFormatter

Content handler for the Dolly-v2-12b model

CustomOpenAIContentFormatter

Content formatter for models that use the OpenAI like API scheme.

LlamaContentFormatter

Deprecated: Kept for backwards compatibility

AzureMLBaseEndpoint

Azure ML Online Endpoint models.

AzureMLOnlineEndpoint

Azure ML Online Endpoint models.

MinimaxCommon

Common parameters for Minimax large language models.

Minimax

Minimax large language models.

MoonshotCommon

Common parameters for Moonshot LLMs.

Moonshot

Moonshot large language models.

BaseOpenAI

Base OpenAI large language model class.

PaiEasEndpoint

Langchain LLM class to help to access eass llm service.

BaichuanLLM

Baichuan large language models.

SparkLLM

iFlyTek Spark completion model integration.

EdenAI

EdenAI models.

SambaStudio

SambaStudio large language models.

Config

SambaNovaCloud

SambaNova Cloud large language models.

Config

NLPCloud

NLPCloud large language models.

MosaicML

MosaicML LLM service.

GooseAI

GooseAI large language models.

Replicate

Replicate models.

Mlflow

MLflow LLM service.

OpaquePrompts

LLM that uses OpaquePrompts to sanitize prompts.

LlamaCpp

llama.cpp model.

Anyscale

Anyscale large language models.

HumanInputLLM

User input as the response.

Petals

Petals Bloom models.

AviaryBackend

Aviary backend.

Aviary

Aviary hosted models.

ForefrontAI

ForefrontAI large language models.

LayerupSecurity

Layerup Security LLM service.

KoboldApiLLM

Kobold API language model.

ExLlamaV2

ExllamaV2 API.

Modal

Modal large language models.

NIBittensorLLM

NIBittensor LLMs

SolarCommon

Common configuration for Solar LLMs.

Solar

Solar large language models.

OllamaEndpointNotFoundError

Raised when the Ollama endpoint is not found.

ChatGLM3

ChatGLM3 LLM service.

VolcEngineMaasBase

Base class for VolcEngineMaas models.

VolcEngineMaasLLM

volc engine maas hosts a plethora of models.

DeepSparse

Neural Magic DeepSparse LLM interface.

SelfHostedHuggingFaceLLM

HuggingFace Pipeline API to run on self-hosted remote hardware.

LLMInputOutputAdapter

Adapter class to prepare the inputs from Langchain to a format

BedrockBase

Base class for Bedrock models.

Baseten

Baseten model

SelfHostedPipeline

Model inference on self-hosted remote hardware.

Konko

Konko AI models.

CTranslate2

CTranslate2 language model.

PipelineAI

PipelineAI large language models.

Tongyi

Tongyi completion model integration.

CloudflareWorkersAI

Cloudflare Workers AI service.

TextGen

Text generation models from WebUI.

YandexGPT

Yandex large language models.

GPT4All

GPT4All language models.

TrainResult

Train result.

GradientLLM

Gradient.ai LLM Endpoints.

Llamafile

Llamafile lets you distribute and run large language models with a

StochasticAI

StochasticAI large language models.

OctoAIEndpoint

OctoAI LLM Endpoints - OpenAI compatible.

RWKV

RWKV language models.

Outlines

LLM wrapper for the Outlines library.

CerebriumAI

CerebriumAI large language models.

OpenLM

OpenLM models.

BigdlLLM

Wrapper around the BigdlLLM model

ContentHandlerAmazonAPIGateway

Adapter to prepare the inputs from Langchain to a format

AmazonAPIGateway

Amazon API Gateway to access LLM models hosted on AWS.

Xinference

Xinference large-scale model inference service.

QianfanLLMEndpoint

Baidu Qianfan completion model integration.

VLLM

VLLM language model.

VLLMOpenAI

vLLM OpenAI-compatible API client

Beam

Beam API for gpt2 large language model.

DeepInfra

DeepInfra models.

OpenLLM

OpenAI's compatible API client for OpenLLM server

TokenExpiredError

Raises when token expired.

ServerError

Raises when encounter server error when making inference.

BaseOCIModelDeployment

Base class for LLM deployed on OCI Data Science Model Deployment.

OCIModelDeploymentLLM

LLM deployed on OCI Data Science Model Deployment.

OCIModelDeploymentTGI

OCI Data Science Model Deployment TGI Endpoint.

OCIModelDeploymentVLLM

VLLM deployed on OCI Data Science Model Deployment

ChatGLM

ChatGLM LLM service.

Arcee

Arcee's Domain Adapted Language Models (DALMs).

Yuan2

Yuan2.0 language models.

CTransformers

C Transformers LLM models.

AlephAlpha

Aleph Alpha large language models.

Clarifai

Clarifai large language models.

PromptLayerOpenAI

PromptLayer OpenAI large language models.

PromptLayerOpenAIChat

PromptLayer OpenAI large language models.

IpexLLM

IpexLLM model.

BaseCohere

Base class for Cohere models.

Cohere

Cohere large language models.

SagemakerEndpoint

Sagemaker Inference Endpoint models.

GooglePalm

DEPRECATED: Use langchain_google_genai.GoogleGenerativeAI instead.

OpenAI

OpenAI large language models.

AzureOpenAI

Azure-specific OpenAI large language models.

OpenAIChat

OpenAI Chat large language models.

HuggingFaceTextGenInference

HuggingFace text generation API.

Fireworks

Fireworks models.

VertexAI

Google Vertex AI large language models.

VertexAIModelGarden

Vertex AI Model Garden large language models.

Ollama

Ollama locally runs large language models.

PredictionGuard

Prediction Guard large language models.

HuggingFaceEndpoint

HuggingFace Endpoint.

Bedrock

Bedrock models.

HuggingFaceHub

HuggingFaceHub models.

Databricks

Databricks serving endpoint or a cluster driver proxy app for LLM.

HuggingFacePipeline

HuggingFace Pipeline API.

WatsonxLLM

IBM watsonx.ai large language models.

Together

LLM models from Together.

Anthropic

Anthropic large language models.

Functions

get_type_to_cls_dict

completion_with_retry

Use tenacity to retry the completion call.

acompletion_with_retry

Use tenacity to retry the completion call.

load_llm_from_config

Load LLM from Config Dict.

load_llm

Load LLM from a file.

completion_with_retry

Use tenacity to retry the completion call.

make_request

Generate text from the model.

completion_with_retry

Use tenacity to retry the completion call.

enforce_stop_tokens

Cut off the text as soon as any stop words occur.

update_token_usage

Update token usage.

completion_with_retry

Use tenacity to retry the completion call.

acompletion_with_retry

Use tenacity to retry the async completion call.

update_token_usage

Update token usage.

create_llm_result

Create the LLMResult from the choices and prompts.

get_models

List available models

get_completions

Get completions from Aviary models.

conditional_decorator

Conditionally apply a decorator.

completion_with_retry

Use tenacity to retry the completion call.

acompletion_with_retry

Use tenacity to retry the completion call.

completion_with_retry_batching

Use tenacity to retry the completion call.

acompletion_with_retry_batching

Use tenacity to retry the completion call.

acompletion_with_retry_streaming

Use tenacity to retry the completion call for streaming.

default_guardrail_violation_handler

Default guardrail violation handler.

clean_url

Remove trailing slash and /api from url if present.

is_codey_model

Return True if the model name is a Codey model.

is_gemini_model

Return True if the model name is a Gemini model.

completion_with_retry

Use tenacity to retry the completion call.

acompletion_with_retry

Use tenacity to retry the completion call.

get_repl_context

Get the notebook REPL context if running inside a Databricks notebook.

get_default_host

Get the default Databricks workspace hostname.

get_default_api_token

Get the default Databricks personal access token.

check_response

Check the response from the completion call.

generate_with_retry

Use tenacity to retry the completion call.

stream_generate_with_retry

Use tenacity to retry the completion call.

astream_generate_with_retry

Async version of stream_generate_with_retry.

generate_with_last_element_mark

Generate elements from an iterable,

agenerate_with_last_element_mark

Generate elements from an async iterable,

completion_with_retry

Use tenacity to retry the completion call.