Note that Presets apply on the dataset level. If you looking at row-level evaluations (e.g. scoring relevance, correcteness, etc. for LLM outputs and RAG), it’s best to explore built-in descriptors.
Text Evals
Evals for text and LLMs.
Data Drift
Data distribution drift detection.
Data Summary
Dataset overview and statistics .
Classification
Quality for classification tasks.
Regression
Quality for regression tasks.