Ask a question to get started
Enter to send•Shift+Enter new line
Evaluate a trajectory.
evaluate_agent_trajectory( self, *, prediction: str, agent_trajectory: Sequence[tuple[AgentAction, str]], input: str, reference: str | None = None, **kwargs: Any = {} ) -> dict
prediction
str
The final predicted response.
agent_trajectory
Sequence[tuple[AgentAction, str]]
The intermediate steps forming the agent trajectory.
input
The input to the agent.
reference
str | None
None
The reference answer.
**kwargs
Any
{}
Additional keyword arguments.