Module●Since v0.3

cache

.. warning:: Beta Feature!

Cache provides an optional caching layer for LLMs.

Cache is useful for two reasons:

It can save you money by reducing the number of API calls you make to the LLM provider if you're often requesting the same completion multiple times.
It can speed up your application by reducing the number of API calls you make to the LLM provider.

Cache directly competes with Memory. See documentation for Pros and Cons.

Class hierarchy:

.. code-block::

BaseCache --> <name>Cache  # Examples: InMemoryCache, RedisCache, GPTCache

Attributes

CASSANDRA_CACHE_DEFAULT_TABLE_NAME: str

attribute

CASSANDRA_CACHE_DEFAULT_TTL_SECONDS: None

attribute

CASSANDRA_SEMANTIC_CACHE_DEFAULT_DISTANCE_METRIC: str

attribute

CASSANDRA_SEMANTIC_CACHE_DEFAULT_SCORE_THRESHOLD: float

attribute

CASSANDRA_SEMANTIC_CACHE_DEFAULT_TABLE_NAME: str

attribute

CASSANDRA_SEMANTIC_CACHE_DEFAULT_TTL_SECONDS: None

attribute

CASSANDRA_SEMANTIC_CACHE_EMBEDDING_CACHE_SIZE: int

Classes

class

CassandraSetupMode

class

CosmosDBSimilarityType

Cosmos DB Similarity Type as enumerator.

class

CosmosDBVectorSearchType

Cosmos DB Vector Search Type as enumerator.

class

DistanceStrategy

Enumerator of the Distance strategies for calculating distances between vectors.

class

AzureCosmosDBVectorSearch

Azure Cosmos DB for MongoDB vCore vector store.

To use, you should have both:

the pymongo python package installed
a connection string associated with a MongoDB VCore Cluster

class

OpenSearchVectorStore

Cache that stores things in memory.

class

FullLLMCache

SQLite table for full LLM Cache (all generations).

class

SQLAlchemyCache

Cache that uses SQAlchemy as a backend.

class

SQLiteCache

Cache that uses SQLite as a backend.

class

UpstashRedisCache

Cache that uses Upstash Redis as a backend.

class

RedisCache

Cache that uses Redis as a backend. Allows to use a sync redis.Redis client.

class

AsyncRedisCache

Cache that uses Redis as a backend. Allows to use an async redis.asyncio.Redis client.

class

RedisSemanticCache

Cache that uses Redis as a vector-store backend.

class

GPTCache

Cache that uses GPTCache as a backend.

class

MomentoCache

Cache that uses Momento as a backend. See https://gomomento.com/

class

CassandraCache

Cache that uses Cassandra / Astra DB as a backend.

Example:

.. code-block:: python

    import cassio

    from langchain_community.cache import CassandraCache
    from langchain_core.globals import set_llm_cache

    cassio.init(auto=True)  # Requires env. variables, see CassIO docs

    set_llm_cache(CassandraCache())

It uses a single Cassandra table. The lookup keys (which get to form the primary key) are: - prompt, a string - llm_string, a deterministic str representation of the model parameters. (needed to prevent same-prompt-different-model collisions)

class

CassandraSemanticCache

Cache that uses Cassandra as a vector-store backend for semantic (i.e. similarity-based) lookup.

Example:

.. code-block:: python

    import cassio

    from langchain_community.cache import CassandraSemanticCache
    from langchain_core.globals import set_llm_cache

    cassio.init(auto=True)  # Requires env. variables, see CassIO docs

    my_embedding = ...

    set_llm_cache(CassandraSemanticCache(
        embedding=my_embedding,
        table_name="my_semantic_cache",
    ))

It uses a single (vector) Cassandra table and stores, in principle, cached values from several LLMs, so the LLM's llm_string is part of the rows' primary keys.

One can choose a similarity measure (default: "dot" for dot-product). Choosing another one ("cos", "l2") almost certainly requires threshold tuning. (which may be in order nevertheless, even if sticking to "dot").

class

FullMd5LLMCache

SQLite table for full LLM Cache (all generations).

class

SQLAlchemyMd5Cache

Cache that uses SQAlchemy as a backend.

class

AzureCosmosDBSemanticCache

Cache that uses Cosmos DB Mongo vCore vector-store backend

class

AzureCosmosDBNoSqlSemanticCache

Cache that uses Cosmos DB NoSQL backend

class

OpenSearchSemanticCache

Cache that uses OpenSearch vector store backend

class

MemcachedCache

Cache that uses Memcached backend through pymemcache client lib

deprecatedclass

AzureCosmosDBNoSqlVectorSearch

Azure Cosmos DB for NoSQL vector store.

To use, you should have both: - the azure-cosmos python package installed

deprecatedclass

SingleStoreDB

SingleStore DB vector store.

The prerequisite for using this class is the installation of the singlestoredb Python package.

The SingleStoreDB vectorstore can be created by providing an embedding function and the relevant parameters for the database connection, connection pool, and optionally, the names of the table and the fields to use.

deprecatedclass

SingleStoreDBSemanticCache

Cache that uses SingleStore DB as a backend

View source on GitHub

cache

Attributes

Classes

LangChain Assistant

Menu

cache

Attributes

Classes