LangChain Reference home pageLangChain ReferenceLangChain Reference
  • GitHub
  • Main Docs
Deep Agents
LangChain
LangGraph
Integrations
LangSmith
  • Overview
  • MCP Adapters
    Standard Tests
    Text Splitters
    • Overview
    • Agents
    • Callbacks
    • Chains
    • Chat models
    • Embeddings
    • Evaluation
    • Globals
    • Hub
    • Memory
    • Output parsers
    • Retrievers
    • Runnables
    • LangSmith
    • Storage
    ⌘I

    LangChain Assistant

    Ask a question to get started

    Enter to send•Shift+Enter new line

    Menu

    MCP Adapters
    Standard Tests
    Text Splitters
    OverviewAgentsCallbacksChainsChat modelsEmbeddingsEvaluationGlobalsHubMemoryOutput parsersRetrieversRunnablesLangSmithStorage
    Language
    Theme
    Pythonlangchain-classicevaluationqaeval_chainQAEvalChainevaluate
    Methodā—Since v1.0

    evaluate

    Copy
    evaluate(
      self,
      examples: Sequence[dict],
      predictions: Sequence[dict]

    Used in Docs

    • How to add evaluators to an existing experiment (Python only)
    • How to define a code evaluator
    • How to define an LLM-as-a-judge evaluator
    • How to evaluate an LLM application
    • How to evaluate with repetitions
    View source on GitHub
    ,
    question_key
    :
    str
    =
    'query'
    ,
    answer_key
    :
    str
    =
    'answer'
    ,
    prediction_key
    :
    str
    =
    'result'
    ,
    *
    ,
    callbacks
    :
    Callbacks
    =
    None
    )
    ->
    list
    [
    dict
    ]

    Evaluate question answering examples and predictions.