AsyncElasticsearchRetriever

AsyncElasticsearchRetriever(
  self,
  index_name: Union[str, Sequence[str]],
  body_func: Callable

Bases

BaseRetriever

Constructors

Attributes

Inherited fromBaseRetriever(langchain_core)

Attributes

Amodel_config Atags Ametadata

Methods

Minvoke

View source on GitHub

Parameters

Name	Type	Description
`index_name`*	`Union[str, Sequence[str]]`	The name of the index to query. Can also be a list of names.
`body_func`*	`Callable[[str], Dict]`
`content_field`	`Optional[Union[str, Mapping[str, str]]]`	Default:`None`
`document_mapper`	`Optional[Callable[[Mapping], Document]]`	Default:`None`
`client`	`AsyncElasticsearch`	Default:`None`
`es_url`	`str`	Default:`None`
`es_cloud_id`	`str`	Default:`None`
`es_user`	`str`	Default:`None`
`es_api_key`	`str`	Default:`None`
`es_password`	`str`	Default:`None`

constructor

__init__

Name	Type
index_name	Union[str, Sequence[str]]
body_func	Callable[[str], Dict]
content_field	Optional[Union[str, Mapping[str, str]]]
document_mapper	Optional[Callable[[Mapping], Document]]
client	Optional[AsyncElasticsearch]
es_url	Optional[str]
es_cloud_id	Optional[str]
es_user	Optional[str]
es_api_key	Optional[str]
es_password	Optional[str]

Setup:

Install langchain_elasticsearch and start Elasticsearch locally using the start-local script.

pip install -qU langchain_elasticsearch
curl -fsSL https://elastic.co/start-local | sh

This will create an elastic-start-local folder. To start Elasticsearch and Kibana:

cd elastic-start-local
./start.sh

Elasticsearch will be available at http://localhost:9200. The password for the elastic user and API key are stored in the .env file in the elastic-start-local folder.

Initialize the AsyncElasticsearchRetriever instance.

Instantiate:

from langchain_elasticsearch import ElasticsearchRetriever

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    content_field="text",
    es_url="http://localhost:9200",
)

Instantiate with API key (URL):

from langchain_elasticsearch import ElasticsearchRetriever

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    content_field="text",
    es_url="http://localhost:9200",
    es_api_key="your-api-key"
)

Instantiate with username/password (URL):

from langchain_elasticsearch import ElasticsearchRetriever

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    content_field="text",
    es_url="http://localhost:9200",
    es_user="elastic",
    es_password="password"
)

If you want to use a cloud hosted Elasticsearch instance, you can pass in the es_cloud_id argument instead of the es_url argument.

Instantiate from cloud (with username/password):

from langchain_elasticsearch import ElasticsearchRetriever

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    content_field="text",
    es_cloud_id="<cloud_id>",
    es_user="elastic",
    es_password="<password>"
)

Instantiate from cloud (with API key):

from langchain_elasticsearch import ElasticsearchRetriever

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    content_field="text",
    es_cloud_id="<cloud_id>",
    es_api_key="your-api-key"
)

You can also connect to an existing Elasticsearch instance by passing in a pre-existing Elasticsearch connection via the client argument.

Instantiate from existing connection:

from langchain_elasticsearch import ElasticsearchRetriever
from elasticsearch import Elasticsearch

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

client = Elasticsearch("http://localhost:9200")
retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    content_field="text",
    client=client
)

Retrieve documents:

Note: Use invoke() or ainvoke() instead of the deprecated get_relevant_documents() or aget_relevant_documents() methods.

First, index some documents:

from elasticsearch import Elasticsearch

client = Elasticsearch("http://localhost:9200")

# Index sample documents
client.index(
    index="some-index",
    document={"text": "The quick brown fox jumps over the lazy dog"},
    id="1",
    refresh=True
)
client.index(
    index="some-index",
    document={"text": "Python is a popular programming language"},
    id="2",
    refresh=True
)
client.index(
    index="some-index",
    document={"text": "Elasticsearch is a search engine"},
    id="3",
    refresh=True
)

Then retrieve documents:

from langchain_elasticsearch import ElasticsearchRetriever

def body_func(query: str) -> dict:
    return {"query": {"match": {"text": {"query": query}}}}

retriever = ElasticsearchRetriever(
    index_name="some-index",
    body_func=body_func,
    content_field="text",
    es_url="http://localhost:9200"
)

# Retrieve documents
documents = retriever.invoke("Python")
for doc in documents:
    print(f"* {doc.page_content}")

* Python is a popular programming language

Use custom document mapper:

from langchain_elasticsearch import ElasticsearchRetriever
from langchain_core.documents import Document
from elasticsearch import Elasticsearch
from typing import Mapping, Any

def body_func(query: str) -> dict:
    return {"query": {"match": {"custom_field": {"query": query}}}}

def custom_mapper(hit: Mapping[str, Any]) -> Document:
    # Custom logic to extract content and metadata
    return Document(
        page_content=hit["_source"]["custom_field"],
        metadata={"score": hit["_score"]}
    )

client = Elasticsearch("http://localhost:9200")
retriever = ElasticsearchRetriever(
    index_name="langchain-demo",
    body_func=body_func,
    document_mapper=custom_mapper,
    client=client
)

Use with multiple indices:

from langchain_elasticsearch import ElasticsearchRetriever
from elasticsearch import Elasticsearch

def body_func(query: str) -> dict:
    return {
        "query": {
            "multi_match": {
                "query": query,
                "fields": ["text_field_1", "text_field_2"]
            }
        }
    }

client = Elasticsearch("http://localhost:9200")
retriever = ElasticsearchRetriever(
    index_name=["index1", "index2"],
    body_func=body_func,
    content_field={
        "index1": "text_field_1",
        "index2": "text_field_2"
    },
    client=client
)

Use as LangChain retriever in chains:

Note: Before running this example, ensure you have indexed documents in your Elasticsearch index. The retriever will search this index for relevant documents to use as context.

from langchain_elasticsearch import ElasticsearchRetriever
from langchain_core.runnables import RunnablePassthrough
from langchain_core.prompts import ChatPromptTemplate
from langchain_ollama import ChatOllama

# ElasticsearchRetriever is already a BaseRetriever
retriever = ElasticsearchRetriever(
    index_name="some-index",
    body_func=lambda q: {"query": {"match": {"text": {"query": q}}}},
    content_field="text",
    es_url="http://localhost:9200"
)

llm = ChatOllama(model="llama3", temperature=0)

# Create a chain that retrieves documents and then generates a response
def format_docs(docs):
    # Format documents for the prompt
    return "\n\n".join(doc.page_content for doc in docs)

system_prompt = (
    "You are an assistant for question-answering tasks. "
    "Use the following pieces of retrieved context to answer "
    "the question. If you don't know the answer, say that you "
    "don't know. Use three sentences maximum and keep the "
    "answer concise."
    "\n\n"
    "Context: {context}"
)

prompt = ChatPromptTemplate.from_messages([
    ("system", system_prompt),
    ("human", "{question}"),
])

chain = (
    {"context": retriever | format_docs, "question": RunnablePassthrough()}
    | prompt
    | llm
)

result = chain.invoke("what is the answer to this question?")

For synchronous applications, use the ElasticsearchRetriever class. For asynchronous applications, use the AsyncElasticsearchRetriever class.

LangChain Assistant

Menu

AsyncElasticsearchRetriever

Bases

Constructors

Attributes

Inherited fromBaseRetriever(langchain_core)

Attributes

Methods

Inherited fromRunnableSerializable(langchain_core)

Attributes

Methods

Inherited fromSerializable(langchain_core)

Attributes

Methods

Inherited fromRunnable(langchain_core)

Attributes

Methods

Parameters