Ask a question to get started

Class●Since v0.3

SelfHostedHuggingFaceLLM

SelfHostedHuggingFaceLLM(
    self,
    **kwargs: Any = {},
)

Bases

SelfHostedPipeline

Constructors

Attributes

Inherited fromSelfHostedPipeline

Attributes

Apipeline_ref: Any Aclient: Any Aload_fn_kwargs: Optional[dict]

—

Keyword arguments to pass to the model load function.

View source on GitHub

allow_dangerous_deserialization

—

Allow deserialization using pickle which can be dangerous if

Inherited fromRunnableSerializable(langchain_core)

Attributes

Aname

Methods

Mto_json Mconfigurable_fields Mconfigurable_alternatives

Inherited fromBaseModel

Attributes

Auuid

HuggingFace Pipeline API to run on self-hosted remote hardware.

Supported hardware includes auto-launched instances on AWS, GCP, Azure, and Lambda, as well as servers specified by IP address and SSH credentials (such as on-prem, or another cloud like Paperspace, Coreweave, etc.).

To use, you should have the runhouse python package installed.

Only supports text-generation, text2text-generation and summarization for now.

Example using from_model_id:

.. code-block:: python

from langchain_community.llms import SelfHostedHuggingFaceLLM import runhouse as rh gpu = rh.cluster(name="rh-a10x", instance_type="A100:1") hf = SelfHostedHuggingFaceLLM( model_id="google/flan-t5-large", task="text2text-generation", hardware=gpu )

Example passing fn that generates a pipeline (bc the pipeline is not serializable): .. code-block:: python

from langchain_community.llms import SelfHostedHuggingFaceLLM
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
import runhouse as rh

def get_pipeline():
    model_id = "gpt2"
    tokenizer = AutoTokenizer.from_pretrained(model_id)
    model = AutoModelForCausalLM.from_pretrained(model_id)
    pipe = pipeline(
        "text-generation", model=model, tokenizer=tokenizer
    )
    return pipe
hf = SelfHostedHuggingFaceLLM(
    model_load_fn=get_pipeline, model_id="gpt2", hardware=gpu)

get_config_jsonschema

LangChain Assistant

Menu

SelfHostedHuggingFaceLLM

Bases

Constructors

Attributes

Inherited fromSelfHostedPipeline

Attributes

Methods

Inherited fromBaseLLM(langchain_core)

Attributes

Methods

Inherited fromBaseLanguageModel(langchain_core)

Attributes

Methods

Inherited fromRunnableSerializable(langchain_core)

Attributes

Methods

Inherited fromSerializable(langchain_core)

Attributes

Methods

Inherited fromRunnable(langchain_core)

Attributes

Methods

Inherited fromBaseModel

Attributes