LangChain Reference home pageLangChain ReferenceLangChain Reference
  • GitHub
  • Main Docs
Deep Agents
LangChain
LangGraph
Integrations
LangSmith
  • Overview
  • Client
  • AsyncClient
  • Run Helpers
  • Run Trees
  • Evaluation
  • Schemas
  • Utilities
  • Wrappers
  • Anonymizer
  • Testing
  • Expect API
  • Middleware
  • Pytest Plugin
  • Deployment SDK
  • RemoteGraph
⌘I

LangChain Assistant

Ask a question to get started

Enter to send•Shift+Enter new line

Menu

OverviewClientAsyncClientRun HelpersRun TreesEvaluationSchemasUtilitiesWrappersAnonymizerTestingExpect APIMiddlewarePytest PluginDeployment SDKRemoteGraph
Language
Theme
Pythonlangsmithevaluation_arunneraevaluate_existing
Function●Since v0.1

aevaluate_existing

Evaluate existing experiment runs asynchronously.

Copy
aevaluate_existing(
  experiment: Union[str, uuid.UUID, schemas.TracerSession],
  ,
  evaluators: Optional[Sequence[Union[EVALUATOR_T, AEVALUATOR_T]]] = None,
  summary_evaluators: Optional[Sequence[SUMMARY_EVALUATOR_T]] = None,
  metadata: Optional[dict] = None,
  max_concurrency: Optional[int] = 0,
  client: Optional[langsmith.Client] = None,
  load_nested: bool = False,
  blocking: bool = True
) -> AsyncExperimentResults

Parameters

NameTypeDescription
experiment*Union[str, uuid.UUID]

The identifier of the experiment to evaluate.

evaluatorsOptional[Sequence[EVALUATOR_T]]
Default:None

Optional sequence of evaluators to use for individual run evaluation.

summary_evaluatorsOptional[Sequence[SUMMARY_EVALUATOR_T]]
Default:None

Optional sequence of evaluators to apply over the entire dataset.

metadataOptional[dict]
Default:None

Optional metadata to include in the evaluation results.

max_concurrencyint | None
Default:0

The maximum number of concurrent evaluations to run.

If None then no limit is set. If 0 then no concurrency.

clientOptional[langsmith.Client]
Default:None

Optional Langsmith client to use for evaluation.

load_nestedbool
Default:False

Whether to load all child runs for the experiment.

Default is to only load the top-level root runs.

blockingbool
Default:True

Whether to block until evaluation is complete.

View source on GitHub