Toggle Menu

`langchain.evaluation.schema`.EvaluatorType¶

class langchain.evaluation.schema.EvaluatorType(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶

The types of the evaluators.

QA = 'qa'¶: Question answering evaluator, which grades answers to questions directly using an LLM.

COT_QA = 'cot_qa'¶: Chain of thought question answering evaluator, which grades answers to questions using chain of thought ‘reasoning’.

CONTEXT_QA = 'context_qa'¶: Question answering evaluator that incorporates ‘context’ in the response.

PAIRWISE_STRING = 'pairwise_string'¶: The pairwise string evaluator, which predicts the preferred prediction from between two models.

SCORE_STRING = 'score_string'¶: The scored string evaluator, which gives a score between 1 and 10 to a prediction.

LABELED_PAIRWISE_STRING = 'labeled_pairwise_string'¶: The labeled pairwise string evaluator, which predicts the preferred prediction from between two models based on a ground truth reference label.

LABELED_SCORE_STRING = 'labeled_score_string'¶: The labeled scored string evaluator, which gives a score between 1 and 10 to a prediction based on a ground truth reference label.

AGENT_TRAJECTORY = 'trajectory'¶: The agent trajectory evaluator, which grades the agent’s intermediate steps.

CRITERIA = 'criteria'¶: The criteria evaluator, which evaluates a model based on a custom set of criteria without any reference labels.

LABELED_CRITERIA = 'labeled_criteria'¶: The labeled criteria evaluator, which evaluates a model based on a custom set of criteria, with a reference label.

STRING_DISTANCE = 'string_distance'¶: Compare predictions to a reference answer using string edit distances.

EXACT_MATCH = 'exact_match'¶: Compare predictions to a reference answer using exact matching.

REGEX_MATCH = 'regex_match'¶: Compare predictions to a reference answer using regular expressions.

PAIRWISE_STRING_DISTANCE = 'pairwise_string_distance'¶: Compare predictions based on string edit distances.

EMBEDDING_DISTANCE = 'embedding_distance'¶: Compare a prediction to a reference label using embedding distance.

PAIRWISE_EMBEDDING_DISTANCE = 'pairwise_embedding_distance'¶: Compare two predictions using embedding distance.

JSON_VALIDITY = 'json_validity'¶: Check if a prediction is valid JSON.

JSON_EQUALITY = 'json_equality'¶: Check if a prediction is equal to a reference JSON.

JSON_EDIT_DISTANCE = 'json_edit_distance'¶: Compute the edit distance between two JSON strings after canonicalization.

JSON_SCHEMA_VALIDATION = 'json_schema_validation'¶: Check if a prediction is valid JSON according to a JSON schema.

Examples using EvaluatorType¶