Module●Since v0.3

vectorstores

Vector store stores embedded data and performs vector search.

One of the most common ways to store and search over unstructured data is to embed it and store the resulting embedding vectors, and then query the store and retrieve the data that are 'most similar' to the embedded query.

Classes

class

Aerospike

Aerospike vector store.

To use, you should have the aerospike_vector_search python package installed.

class

AlibabaCloudOpenSearch

Alibaba Cloud OpenSearch vector store.

class

AlibabaCloudOpenSearchSettings

Alibaba Cloud Opensearch` client configuration.

class

AnalyticDB

AnalyticDB (distributed PostgreSQL) vector store.

AnalyticDB is a distributed full postgresql syntax cloud-native database.

connection_string is a postgres connection string.
embedding_function any embedding function implementing langchain.embeddings.base.Embeddings interface.
collection_name is the name of the collection to use. (default: langchain)
- NOTE: This is not the name of the table, but the name of the collection. The tables will be created when initializing the store (if not exists) So, make sure the user has the right permissions to create tables.
pre_delete_collection if True, will delete the collection if it exists. (default: False)
- Useful for testing.

class

Annoy

Annoy vector store.

To use, you should have the annoy python package installed.

class

ApacheDoris

Apache Doris vector store.

You need a pymysql python package, and a valid account to connect to Apache Doris.

For more information, please visit Apache Doris official site Apache Doris github

Atlas vector store.

Atlas is the Nomic's neural database and rhizomatic instrument.

To use, you should have the nomic python package installed.

class

AwaDB

AwaDB vector store.

class

AzureCosmosDBVectorSearch

Azure Cosmos DB for MongoDB vCore vector store.

To use, you should have both:

the pymongo python package installed
a connection string associated with a MongoDB VCore Cluster

class

AzureSearch

Azure Cognitive Search vector store.

class

Bagel

Bagel.net Inference platform.

To use, you should have the bagelML python package installed.

class

BESVectorStore

Baidu Elasticsearch vector store.

class

BaiduVectorDB

Baidu VectorDB as a vector store.

In order to use this you need to have a database instance. See the following documentation for details: https://cloud.baidu.com/doc/VDB/index.html

Clarifai AI vector store.

To use, you should have the clarifai python SDK package installed.

class

Clickhouse

ClickHouse vector store integration.

class

ClickhouseSettings

ClickHouse client configuration.

class

DashVector

DashVector vector store.

To use, you should have the dashvector python package installed.

class

Dingo

Dingo vector store.

To use, you should have the dingodb python package installed.

class

DocArrayHnswSearch

HnswLib storage using DocArray package.

To use it, you should have the docarray package with version >=0.32.0 installed. You can install it with pip install docarray.

class

DocArrayInMemorySearch

In-memory DocArray storage for exact search.

To use it, you should have the docarray package with version >=0.32.0 installed. You can install it with pip install docarray.

class

DocumentDBVectorSearch

Amazon DocumentDB (with MongoDB compatibility) vector store. Please refer to the official Vector Search documentation for more details: https://docs.aws.amazon.com/documentdb/latest/developerguide/vector-search.html

To use, you should have both:

the pymongo python package installed
a connection string and credentials associated with a DocumentDB cluster

class

DuckDB

DuckDB vector store.

This class provides a vector store interface for adding texts and performing similarity searches using DuckDB.

For more information about DuckDB, see: https://duckdb.org/

This integration requires the duckdb Python package. You can install it with pip install duckdb.

Security Notice: The default DuckDB configuration is not secure.

By **default**, DuckDB can interact with files across the entire file system,
which includes abilities to read, write, and list files and directories.
It can also access some python variables present in the global namespace.

When using this DuckDB vectorstore, we suggest that you initialize the
DuckDB connection with a secure configuration.

For example, you can set `enable_external_access` to `false` in the connection
configuration to disable external access to the DuckDB connection.

You can view the DuckDB configuration options here:

https://duckdb.org/docs/configuration/overview.html

Please review other relevant security considerations in the DuckDB
documentation. (e.g., "autoinstall_known_extensions": "false",
"autoload_known_extensions": "false")

See https://python.langchain.com/docs/security for more information.

class

EcloudESVectorStore

ecloud Elasticsearch vector store.

class

Epsilla

Wrapper around Epsilla vector database.

As a prerequisite, you need to install pyepsilla package and have a running Epsilla vector database (for example, through our docker image) See the following documentation for how to run an Epsilla vector database: https://epsilla-inc.gitbook.io/epsilladb/quick-start

class

FAISS

FAISS vector store integration.

See The FAISS Library paper.

class

Hologres

Hologres API vector store.

connection_string is a hologres connection string.
embedding_function any embedding function implementing langchain.embeddings.base.Embeddings interface.
ndims is the number of dimensions of the embedding output.
table_name is the name of the table to store embeddings and data. (default: langchain_pg_embedding)
- NOTE: The table will be created when initializing the store (if not exists) So, make sure the user has the right permissions to create tables.
pre_delete_table if True, will delete the table if it exists. (default: False)
- Useful for testing.

class

InfinispanVS

Infinispan VectorStore interface.

This class exposes the method to present Infinispan as a VectorStore. It relies on the Infinispan class (below) which takes care of the REST interface with the server.

class

KDBAI

KDB.AI vector store.

See https://kdb.ai.

To use, you should have the kdbai_client python package installed.

class

DistanceStrategy

Enumerator of the Distance strategies.

class

Kinetica

Kinetica vector store.

To use, you should have the gpudb python package installed.

class

KineticaSettings

Kinetica client configuration.

class

LanceDB

LanceDB vector store.

To use, you should have lancedb python package installed. You can install it with pip install lancedb.

class

Lantern

Postgres with the lantern extension as a vector store.

lantern uses sequential scan by default. but you can create a HNSW index using the create_hnsw_index method.

connection_string is a postgres connection string.
embedding_function any embedding function implementing langchain.embeddings.base.Embeddings interface.
collection_name is the name of the collection to use. (default: langchain)
- NOTE: This is the name of the table in which embedding data will be stored The table will be created when initializing the store (if not exists) So, make sure the user has the right permissions to create tables.
distance_strategy is the distance strategy to use. (default: EUCLIDEAN)
- EUCLIDEAN is the euclidean distance.
- COSINE is the cosine distance.
- HAMMING is the hamming distance.
pre_delete_collection if True, will delete the collection if it exists. (default: False)
- Useful for testing.

class

LLMRails

Implementation of Vector Store using LLMRails.

See https://llmrails.com/

class

ManticoreSearch

ManticoreSearch Engine vector store.

To use, you should have the manticoresearch python package installed.

class

ManticoreSearchSettings

class

Marqo

Marqo vector store.

Marqo indexes have their own models associated with them to generate your embeddings. This means that you can selected from a range of different models and also use CLIP models to create multimodal indexes with images and text together.

Marqo also supports more advanced queries with multiple weighted terms, see See https://docs.marqo.ai/latest/#searching-using-weights-in-queries. This class can flexibly take strings or dictionaries for weighted queries in its similarity search methods.

To use, you should have the marqo python package installed, you can do this with pip install marqo.

class

Meilisearch

Meilisearch vector store.

To use this, you need to have meilisearch python package installed, and a running Meilisearch instance.

To learn more about Meilisearch Python, refer to the in-depth Meilisearch Python documentation: https://meilisearch.github.io/meilisearch-python/.

See the following documentation for how to run a Meilisearch instance: https://www.meilisearch.com/docs/learn/getting_started/quick_start.

class

MomentoVectorIndex

Momento Vector Index (MVI) vector store.

Momento Vector Index is a serverless vector index that can be used to store and search vectors. To use you should have the momento python package installed.

class

MyScale

MyScale vector store.

You need a clickhouse-connect python package, and a valid account to connect to MyScale.

MyScale can not only search with simple vector indexes. It also supports a complex query with multiple conditions, constraints and even sub-queries.

For more information, please visit myscale official site

class

MyScaleSettings

MyScale client configuration.

class

OpenSearchVectorSearch

Amazon OpenSearch Vector Engine vector store.

class

PathwayVectorClient

VectorStore connecting to Pathway Vector Store.

class

PGEmbedding

Postgres with the pg_embedding extension as a vector store.

pg_embedding uses sequential scan by default. but you can create a HNSW index using the create_hnsw_index method.

connection_string is a postgres connection string.
embedding_function any embedding function implementing langchain.embeddings.base.Embeddings interface.
collection_name is the name of the collection to use. (default: langchain)
- NOTE: This is not the name of the table, but the name of the collection. The tables will be created when initializing the store (if not exists) So, make sure the user has the right permissions to create tables.
distance_strategy is the distance strategy to use. (default: EUCLIDEAN)
- EUCLIDEAN is the euclidean distance.
pre_delete_collection if True, will delete the collection if it exists. (default: False)
- Useful for testing.

class

Relyt

Relyt (distributed PostgreSQL) vector store.

Relyt is a distributed full postgresql syntax cloud-native database.

connection_string is a postgres connection string.
embedding_function any embedding function implementing langchain.embeddings.base.Embeddings interface.
collection_name is the name of the collection to use. (default: langchain)
- NOTE: This is not the name of the table, but the name of the collection. The tables will be created when initializing the store (if not exists) So, make sure the user has the right permissions to create tables.
pre_delete_collection if True, will delete the collection if it exists. (default: False)
- Useful for testing.

class

Rockset

Rockset vector store.

To use, you should have the rockset python package installed. Note that to use this, the collection being used must already exist in your Rockset instance. You must also ensure you use a Rockset ingest transformation to apply VECTOR_ENFORCE on the column being used to store embedding_key in the collection. See: https://rockset.com/blog/introducing-vector-search-on-rockset/ for more details

Everything below assumes commons Rockset workspace.

class

ScaNN

ScaNN vector store.

To use, you should have the scann python package installed.

class

SemaDB

SemaDB vector store.

This vector store is a wrapper around the SemaDB database.

class

SKLearnVectorStore

Simple in-memory vector store based on the scikit-learn library NearestNeighbors.

class

SQLiteVec

SQLite with Vec extension as a vector database.

To use, you should have the sqlite-vec python package installed. Example: .. code-block:: python from langchain_community.vectorstores import SQLiteVec from langchain_community.embeddings.openai import OpenAIEmbeddings ...

class

SQLiteVSS

SQLite with VSS extension as a vector database.

To use, you should have the sqlite-vss python package installed. Example: .. code-block:: python from langchain_community.vectorstores import SQLiteVSS from langchain_community.embeddings.openai import OpenAIEmbeddings ...

class

StarRocks

StarRocks vector store.

You need a pymysql python package, and a valid account to connect to StarRocks.

Right now StarRocks has only implemented cosine_similarity function to compute distance between two vectors. And there is no vector inside right now, so we have to iterate all vectors and compute spatial distance.

For more information, please visit StarRocks official site StarRocks github

class

SupabaseVectorStore

Supabase Postgres vector store.

It assumes you have the pgvector extension installed and a match_documents (or similar) function. For more details: https://integrations.langchain.com/vectorstores?integration_name=SupabaseVectorStore

You can implement your own match_documents function in order to limit the search space to a subset of documents based on your own authorization or business logic.

Note that the Supabase Python client does not yet support async operations.

If you'd like to use max_marginal_relevance_search, please review the instructions below on modifying the match_documents function to return matched embeddings.

Examples:

.. code-block:: python

from langchain_community.embeddings.openai import OpenAIEmbeddings
from langchain_core.documents import Document
from langchain_community.vectorstores import SupabaseVectorStore
from supabase.client import create_client

docs = [
    Document(page_content="foo", metadata={"id": 1}),
]
embeddings = OpenAIEmbeddings()
supabase_client = create_client("my_supabase_url", "my_supabase_key")
vector_store = SupabaseVectorStore.from_documents(
    docs,
    embeddings,
    client=supabase_client,
    table_name="documents",
    query_name="match_documents",
    chunk_size=500,
)

To load from an existing table:

.. code-block:: python

from langchain_community.embeddings.openai import OpenAIEmbeddings
from langchain_community.vectorstores import SupabaseVectorStore
from supabase.client import create_client

embeddings = OpenAIEmbeddings()
supabase_client = create_client("my_supabase_url", "my_supabase_key")
vector_store = SupabaseVectorStore(
    client=supabase_client,
    embedding=embeddings,
    table_name="documents",
    query_name="match_documents",
)

class

SurrealDBStore

SurrealDB as Vector Store.

To use, you should have the surrealdb python package installed.

class

TablestoreVectorStore

Tablestore vector store.

To use, you should have the tablestore python package installed.

Tair vector store.

Tencent VectorDB as a vector store.

In order to use this you need to have a database instance. See the following documentation for details: https://cloud.tencent.com/document/product/1709/104489

class

NeuralDBClientVectorStore

Vectorstore that uses ThirdAI's NeuralDB Enterprise Python Client for NeuralDBs.

To use, you should have the thirdai[neural_db] python package installed.

class

NeuralDBVectorStore

Vectorstore that uses ThirdAI's NeuralDB.

To use, you should have the thirdai[neural_db] python package installed.

TiDB Vector Store.

TileDB vector store.

To use, you should have the tiledb-vector-search python package installed.

class

TimescaleVector

Timescale Postgres vector store

To use, you should have the timescale_vector python package installed.

class

Typesense

Typesense vector store.

To use, you should have the typesense python package installed.

class

UpstashVectorStore

Upstash Vector vector store

To use, the upstash-vector python package must be installed.

Also an Upstash Vector index is required. First create a new Upstash Vector index and copy the index_url and index_token variables. Then either pass them through the constructor or set the environment variables UPSTASH_VECTOR_REST_URL and UPSTASH_VECTOR_REST_TOKEN.

class

USearch

USearch vector store.

To use, you should have the usearch python package installed.

class

Vald

Vald vector database.

To use, you should have the vald-client-python python package installed.

Vectara API vector store.

See (https://vectara.com).

class

VespaStore

Vespa vector store.

To use, you should have the python client library pyvespa installed.

class

VLite

VLite is a simple and fast vector database for semantic search.

class

Yellowbrick

Yellowbrick as a vector database. Example: .. code-block:: python from langchain_community.vectorstores import Yellowbrick from langchain_community.embeddings.openai import OpenAIEmbeddings ...

class

ZepVectorStore

Zep vector store.

It provides methods for adding texts or documents to the store, searching for similar documents, and deleting documents.

Search scores are calculated using cosine similarity normalized to [0, 1].

class

ZepCloudVectorStore

Zep vector store.

It provides methods for adding texts or documents to the store, searching for similar documents, and deleting documents.

Search scores are calculated using cosine similarity normalized to [0, 1].

class

Zilliz

Zilliz vector store.

You need to have pymilvus installed and a running Zilliz database.

See the following documentation for how to run a Zilliz instance: https://docs.zilliz.com/docs/create-cluster

IF USING L2/IP metric IT IS HIGHLY SUGGESTED TO NORMALIZE YOUR DATA.

deprecatedclass

AstraDB

deprecatedclass

AzureCosmosDBNoSqlVectorSearch

Azure Cosmos DB for NoSQL vector store.

To use, you should have both: - the azure-cosmos python package installed

deprecatedclass

BigQueryVectorSearch

Google Cloud BigQuery vector store.

To use, you need the following packages installed: google-cloud-bigquery

deprecatedclass

Chroma

ChromaDB vector store.

To use, you should have the chromadb python package installed.

deprecatedclass

CouchbaseVectorStore

Couchbase Vector Store vector store.

To use it, you need

a recent installation of the couchbase library
a Couchbase database with a pre-defined Search index with support for vector fields

deprecatedclass

ElasticKnnSearch

[DEPRECATED] Elasticsearch with k-nearest neighbor search (k-NN) vector store.

Recommended to use ElasticsearchStore instead, which supports metadata filtering, customising the query retriever and much more!

It creates an Elasticsearch index of text data that can be searched using k-NN search. The text data is transformed into vector embeddings using a provided embedding model, and these embeddings are stored in the Elasticsearch index.

deprecatedclass

ElasticVectorSearch

ElasticVectorSearch uses the brute force method of searching on vectors.

Recommended to use ElasticsearchStore instead, which gives you the option to uses the approx HNSW algorithm which performs better on large datasets.

ElasticsearchStore also supports metadata filtering, customising the query retriever and much more!

To connect to an Elasticsearch instance that does not require login credentials, pass the Elasticsearch URL and index name along with the embedding object to the constructor.

deprecatedclass

ElasticsearchStore

Elasticsearch vector store.

deprecatedclass

PGVector

Postgres/PGVector vector store.

DEPRECATED: This class is pending deprecation and will likely receive no updates. An improved version of this class is available in langchain_postgres as PGVector. Please use that class instead.

When migrating please keep in mind that:
    * The new implementation works with psycopg3, not with psycopg2
      (This implementation does not work with psycopg3).
    * Filtering syntax has changed to use $ prefixed operators for JSONB
      metadata fields. (New implementation only uses JSONB field for metadata)
    * The new implementation made some schema changes to address issues
      with the existing implementation. So you will need to re-create
      your tables and re-index your data or else carry out a manual
      migration.

To use, you should have the pgvector python package installed.

deprecatedclass

Redis

Redis vector database.

deprecatedclass

SingleStoreDB

SingleStore DB vector store.

The prerequisite for using this class is the installation of the singlestoredb Python package.

The SingleStoreDB vectorstore can be created by providing an embedding function and the relevant parameters for the database connection, connection pool, and optionally, the names of the table and the fields to use.

Modules

module

vald

Wrapper around Vald vector database.

elastic_vector_search

Pathway Vector Store client.

The Pathway Vector Server is a pipeline written in the Pathway framweork which indexes all files in a given folder, embeds them, and builds a vector index. The pipeline reacts to changes in source files, automatically updating appropriate index entries.

The PathwayVectorClient implements the LangChain VectorStore interface and queries the PathwayVectorServer to retrieve up-to-date documents.