Header menu logo Nao

Nao.Eval Namespace

Type/Module Description

EvalCase (Module)

EvalCase (Type)

A single evaluation test case

EvalDataset (Module)

EvalDataset (Type)

A dataset is a named collection of eval cases

EvalReport (Module)

EvalReport (Type)

Aggregate report of an evaluation run

EvalResult (Module)

EvalResult (Type)

The result of evaluating a single case

EvalRunner

The evaluation runner: runs cases against an agent and scores them

EvalRunnerConfig

Configuration for the evaluation runner

EvalVerdict

The verdict of a single evaluation

IEvaluator

Interface for evaluating agent outputs against expectations

TagSummary

Summary statistics for a specific tag

Type something to start searching.