langchain.evaluation.exact_match.base
.ExactMatchStringEvaluatorΒΆ
- class langchain.evaluation.exact_match.base.ExactMatchStringEvaluator(*, ignore_case: bool = False, ignore_punctuation: bool = False, ignore_numbers: bool = False, **kwargs: Any)[source]ΒΆ
Compute an exact match between the prediction and the reference.
Examples
>>> evaluator = ExactMatchChain() >>> evaluator.evaluate_strings( prediction="Mindy is the CTO", reference="Mindy is the CTO", ) # This will return {'score': 1.0}
>>> evaluator.evaluate_strings( prediction="Mindy is the CTO", reference="Mindy is the CEO", ) # This will return {'score': 0.0}
Attributes
evaluation_name
Get the evaluation name.
input_keys
Get the input keys.
requires_input
This evaluator does not require input.
requires_reference
This evaluator requires a reference.
Methods
__init__
(*[, ignore_case, ...])aevaluate_strings
(*, prediction[, ...])Asynchronously evaluate Chain or LLM output, based on optional input and label.
evaluate_strings
(*, prediction[, reference, ...])Evaluate Chain or LLM output, based on optional input and label.
- __init__(*, ignore_case: bool = False, ignore_punctuation: bool = False, ignore_numbers: bool = False, **kwargs: Any)[source]ΒΆ
- async aevaluate_strings(*, prediction: str, reference: Optional[str] = None, input: Optional[str] = None, **kwargs: Any) dict ΒΆ
Asynchronously evaluate Chain or LLM output, based on optional input and label.
- Parameters
prediction (str) β The LLM or chain prediction to evaluate.
reference (Optional[str], optional) β The reference label to evaluate against.
input (Optional[str], optional) β The input to consider during evaluation.
**kwargs β Additional keyword arguments, including callbacks, tags, etc.
- Returns
The evaluation results containing the score or value.
- Return type
dict
- evaluate_strings(*, prediction: str, reference: Optional[str] = None, input: Optional[str] = None, **kwargs: Any) dict ΒΆ
Evaluate Chain or LLM output, based on optional input and label.
- Parameters
prediction (str) β The LLM or chain prediction to evaluate.
reference (Optional[str], optional) β The reference label to evaluate against.
input (Optional[str], optional) β The input to consider during evaluation.
**kwargs β Additional keyword arguments, including callbacks, tags, etc.
- Returns
The evaluation results containing the score or value.
- Return type
dict