Skip to main content
These are pre-built evaluation templates that are easy to run without setup. They are great for a start: you can create a custom setup later.
Note that Presets apply on the dataset level. If you looking at row-level evaluations (e.g. scoring relevance, correcteness, etc. for LLM outputs and RAG), it’s best to explore built-in descriptors.

Text Evals

Evals for text and LLMs.

Data Drift

Data distribution drift detection.

Data Summary

Dataset overview and statistics .

Classification

Quality for classification tasks.

Regression

Quality for regression tasks.