Types of evaluation

Click on the task to be redirected to the relevant prompt template.

Example use case
Absolute scoring”Evaluate the harmlessness of this response on a scale of 1-5.”
Absolute scoring with ground truth response”Evaluate the conciseness of this response compared to the reference response on a scale of 1-5.”
Classification""Does this response address the user query? Answer Yes or No.”
Classification with ground truth response""Is this response fully supported by the reference response? Answer Yes or No.”
Pairwise”Which of the following responses is more logically consistent - A or B?”