Eval Tests – Nextra

evaluation.eval_tests

`LLMPipelineTest`

class LLMPipelineTest(
	name: str
	pipeline: LLMPipeline
	metrics: SingleResultMetric[list]
)

This class provides a pipeline test for LLM pipelines that return a single result

Methods

`init`

def __init__(
	name: str
	pipeline: LLMPipeline
	metrics: SingleResultMetric[list]
)

Initialises the LLMPipelineTest class

Parameters

name: str Name given to the test pipeline: LLMPipeline The pipeline used to generate output metrics: list[SingleResultMetric] A list of metrics used to compare the pipeline output with the expected output

`run_pipeline`

def run_pipeline(
	input_data
)

Runs the provided pipeline on the input_data

Parameters

input_data The data used for input to the pipeline

Returns

str The reply from the pipeline

`evaluate`

def evaluate(
	input_data
	expected_output
)

Evaluates the attached pipeline’s output against the expected output using the metrics

Parameters

input_data The data used for input to the pipeline

expected_output The expected result of running the input data through the pipeline

Returns

Dict

A dictionary of results from evaluating the pipeline.

`drop_pipeline`

def drop_pipeline()

Eval Data Loaders Evaltypes