langchain.evaluation.schema.EvaluatorType¶

class langchain.evaluation.schema.EvaluatorType(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶

The types of the evaluators.

QA = 'qa'¶

Question answering evaluator, which grades answers to questions directly using an LLM.

COT_QA = 'cot_qa'¶

Chain of thought question answering evaluator, which grades answers to questions using chain of thought ‘reasoning’.

CONTEXT_QA = 'context_qa'¶

Question answering evaluator that incorporates ‘context’ in the response.

PAIRWISE_STRING = 'pairwise_string'¶

The pairwise string evaluator, which predicts the preferred prediction from between two models.

SCORE_STRING = 'score_string'¶

The scored string evaluator, which gives a score between 1 and 10 to a prediction.

LABELED_PAIRWISE_STRING = 'labeled_pairwise_string'¶

The labeled pairwise string evaluator, which predicts the preferred prediction from between two models based on a ground truth reference label.

LABELED_SCORE_STRING = 'labeled_score_string'¶

The labeled scored string evaluator, which gives a score between 1 and 10 to a prediction based on a ground truth reference label.

AGENT_TRAJECTORY = 'trajectory'¶

The agent trajectory evaluator, which grades the agent’s intermediate steps.

CRITERIA = 'criteria'¶

The criteria evaluator, which evaluates a model based on a custom set of criteria without any reference labels.

LABELED_CRITERIA = 'labeled_criteria'¶

The labeled criteria evaluator, which evaluates a model based on a custom set of criteria, with a reference label.

STRING_DISTANCE = 'string_distance'¶

Compare predictions to a reference answer using string edit distances.

EXACT_MATCH = 'exact_match'¶

Compare predictions to a reference answer using exact matching.

REGEX_MATCH = 'regex_match'¶

Compare predictions to a reference answer using regular expressions.

PAIRWISE_STRING_DISTANCE = 'pairwise_string_distance'¶

Compare predictions based on string edit distances.

EMBEDDING_DISTANCE = 'embedding_distance'¶

Compare a prediction to a reference label using embedding distance.

PAIRWISE_EMBEDDING_DISTANCE = 'pairwise_embedding_distance'¶

Compare two predictions using embedding distance.

JSON_VALIDITY = 'json_validity'¶

Check if a prediction is valid JSON.

JSON_EQUALITY = 'json_equality'¶

Check if a prediction is equal to a reference JSON.

JSON_EDIT_DISTANCE = 'json_edit_distance'¶

Compute the edit distance between two JSON strings after canonicalization.

JSON_SCHEMA_VALIDATION = 'json_schema_validation'¶

Check if a prediction is valid JSON according to a JSON schema.

Examples using EvaluatorType¶