LangChain Reference home pageLangChain ReferenceLangChain Reference
  • GitHub
  • Main Docs
Deep Agents
LangChain
LangGraph
Integrations
LangSmith
  • Overview
  • Client
  • AsyncClient
  • Run Helpers
  • Run Trees
  • Evaluation
  • Schemas
  • Utilities
  • Wrappers
  • Anonymizer
  • Testing
  • Expect API
  • Middleware
  • Pytest Plugin
  • Deployment SDK
⌘I

LangChain Assistant

Ask a question to get started

Enter to send•Shift+Enter new line

Menu

OverviewClientAsyncClientRun HelpersRun TreesEvaluationSchemasUtilitiesWrappersAnonymizerTestingExpect APIMiddlewarePytest PluginDeployment SDK
Language
Theme
Pythonlangsmithevaluation_arunneraevaluate_existing
Function●Since v0.1

aevaluate_existing

Copy
aevaluate_existing(
  experiment: Union[str, uuid.UUID, schemas.TracerSession],
  ,
  evaluators:
View source on GitHub
Optional
[
Sequence
[
Union
[
EVALUATOR_T
,
AEVALUATOR_T
]
]
]
=
None
,
summary_evaluators
:
Optional
[
Sequence
[
SUMMARY_EVALUATOR_T
]
]
=
None
,
metadata
:
Optional
[
dict
]
=
None
,
max_concurrency
:
Optional
[
int
]
=
0
,
client
:
Optional
[
langsmith
.
Client
]
=
None
,
load_nested
:
bool
=
False
,
blocking
:
bool
=
True
)
->
AsyncExperimentResults

Parameters

NameTypeDescription
experiment*Union[str, uuid.UUID]

The identifier of the experiment to evaluate.

evaluatorsOptional[Sequence[EVALUATOR_T]]
Default:None
summary_evaluatorsOptional[Sequence[SUMMARY_EVALUATOR_T]]
Default:None
metadataOptional[dict]
Default:None
max_concurrencyint | None
Default:0
clientOptional[langsmith.Client]
Default:None
load_nestedbool
Default:False
blockingbool
Default:True

Evaluate existing experiment runs asynchronously.

Optional sequence of evaluators to use for individual run evaluation.

Optional sequence of evaluators to apply over the entire dataset.

Optional metadata to include in the evaluation results.

The maximum number of concurrent evaluations to run.

If None then no limit is set. If 0 then no concurrency.

Optional Langsmith client to use for evaluation.

Whether to load all child runs for the experiment.

Default is to only load the top-level root runs.

Whether to block until evaluation is complete.