Evaluate
Creating an evaluation
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
A request to an Atla evaluator via the /eval
endpoint.
The input given to a model which produced the model_output
to be evaluated.
The output of the model which is being evaluated. This is the model_output
from the model_input
.
The ID or name of the Atla evaluator model to use. This may point to a specific model version or a model family. If a model family is provided, the default model version for that family will be used.
The criteria used to evaluate the model_output
. Only one of evaluation_criteria
or metric_name
can be provided.
The name of the metric to use for the evaluation. Only one of evaluation_criteria
or metric_name
can be provided.
Any additional context provided to the model which received the model_input
and produced the model_output
.
An optional reference ("ground-truth" / "gold standard") answer against which to evaluate the model_output
.
A list of few-shot examples for the evaluation.