Class●Since v0.2

DocumentIndex

A document retriever that supports indexing operations.

This indexing interface is designed to be a generic abstraction for storing and querying documents that has an ID and metadata associated with it.

The interface is designed to be agnostic to the underlying implementation of the indexing system.

The interface is designed to support the following operations:

Storing document in the index.
Fetching document by ID.
Searching for document using a query.

DocumentIndex(
    self,
    *args: Any = (),
    **kwargs: Any = {},
)

Bases

BaseRetriever

Methods

method

upsert

Upsert documents into the index.

The upsert functionality should utilize the ID field of the content object if it is provided. If the ID is not provided, the upsert method is free to generate an ID for the content.

When an ID is specified and the content already exists in the VectorStore, the upsert method should update the content with the new data. If the content does not exist, the upsert method should add the item to the VectorStore.

method

aupsert

Add or update documents in the VectorStore. Async version of upsert.

The upsert functionality should utilize the ID field of the item if it is provided. If the ID is not provided, the upsert method is free to generate an ID for the item.

When an ID is specified and the item already exists in the VectorStore, the upsert method should update the item with the new data. If the item does not exist, the upsert method should add the item to the VectorStore.

method

delete

Delete by IDs or other criteria.

Calling delete without any input parameters should raise a ValueError!

method

adelete

Delete by IDs or other criteria. Async variant.

Calling adelete without any input parameters should raise a ValueError!

method

get

Get documents by id.

Fewer documents may be returned than requested if some IDs are not found or if there are duplicated IDs.

Users should not assume that the order of the returned documents matches the order of the input IDs. Instead, users should rely on the ID field of the returned documents.

This method should NOT raise exceptions if no documents are found for some IDs.

method

aget

Get documents by id.

Fewer documents may be returned than requested if some IDs are not found or if there are duplicated IDs.

Users should not assume that the order of the returned documents matches the order of the input IDs. Instead, users should rely on the ID field of the returned documents.

This method should NOT raise exceptions if no documents are found for some IDs.

Inherited fromBaseRetriever

Attributes

Amodel_config Atags: list[str] | None

—

Optional list of tags associated with the retriever.

Ametadata: dict[str, Any] | None

—

Optional metadata associated with the retriever.

Methods

Minvoke

—

Invoke the retriever to get relevant documents.

Mainvoke

—

Asynchronously invoke the retriever to get relevant documents.

Inherited fromRunnableSerializable

Attributes

Aname: str

—

The name of the function.

Amodel_config

Methods

Mto_json

—

Convert the graph to a JSON-serializable format.

Mconfigurable_fields Mconfigurable_alternatives

—

Configure alternatives for Runnable objects that can be set at runtime.

Inherited fromSerializable

Attributes

Alc_secrets: dict[str, str]

—

A map of constructor argument names to secret ids.

Alc_attributes: dict

—

List of attribute names that should be included in the serialized kwargs.

Amodel_config

Methods

Mis_lc_serializable

—

Return True as this class is serializable.

Mget_lc_namespace

—

Get the namespace of the LangChain object.

Mlc_id

—

Return a unique identifier for this class for serialization purposes.

Mto_json

—

Convert the graph to a JSON-serializable format.

Mto_json_not_implemented

—

Serialize a "not implemented" object.

Inherited fromRunnable

Attributes

Aname: str

—

The name of the function.

AInputType: Any AOutputType: Any Ainput_schema: type[BaseModel]

—

The type of input this Runnable accepts specified as a Pydantic model.

Aoutput_schema: type[BaseModel]

—

Output schema.

Aconfig_specs: list[ConfigurableFieldSpec]

Methods

Mget_name Mget_input_schema Mget_input_jsonschema

—

Get a JSON schema that represents the input to the Runnable.

Mget_output_schema Mget_output_jsonschema

—

Get a JSON schema that represents the output of the Runnable.

Mconfig_schema

—

The type of config this Runnable accepts specified as a Pydantic model.

Mget_config_jsonschema

—

Get a JSON schema that represents the config of the Runnable.

Mget_graph Mget_prompts

—

Return a list of prompts used by this Runnable.

Mpipe

—

Pipe Runnable objects.

Mpick

—

Pick keys from the output dict of this Runnable.

Massign

—

Merge the Dict input with the output produced by the mapping argument.

Minvoke

—

Invoke the retriever to get relevant documents.

Mainvoke

—

Asynchronously invoke the retriever to get relevant documents.

Mbatch Mbatch_as_completed

—

Run invoke in parallel on a list of inputs.

Mabatch Mabatch_as_completed

—

Run ainvoke in parallel on a list of inputs.

Mstream Mastream Mastream_log

—

Stream all output from a Runnable, as reported to the callback system.

Mastream_events

—

Generate a stream of events.

Mtransform Matransform Mbind

—

Bind arguments to a Runnable, returning a new Runnable.

Mwith_config Mwith_listeners

—

Bind lifecycle listeners to a Runnable, returning a new Runnable.

Mwith_alisteners

—

Bind async lifecycle listeners to a Runnable.

Mwith_types

—

Bind input and output types to a Runnable, returning a new Runnable.

Mwith_retry

—

Create a new Runnable that retries the original Runnable on exceptions.

Mmap

—

Map a function to multiple iterables.

Mwith_fallbacks

—

Add fallbacks to a Runnable, returning a new Runnable.

Mas_tool

—

Create a BaseTool from a Runnable.

View source on GitHub

DocumentIndex

Bases

Methods

Inherited fromBaseRetriever

Attributes

Methods

Inherited fromRunnableSerializable

Attributes

Methods

Inherited fromSerializable

Attributes

Methods

Inherited fromRunnable

Attributes

Methods

LangChain Assistant

Menu

DocumentIndex

Bases

Methods

Inherited fromBaseRetriever

Attributes

Methods

Inherited fromRunnableSerializable

Attributes

Methods

Inherited fromSerializable

Attributes

Methods

Inherited fromRunnable

Attributes

Methods