Evaluation Agent

The Evaluation Agent provides two key capabilities:

Recommends and applies the proper evaluation metrics.
Checks whether your optimized (“smashed”) model is compatible with your inference pipeline.

No worries — you can describe your evaluation goal as a string request (e.g. "image_generation_quality"). Then, provide:

The Optimization Agent will automatically select the appropriate metric and run the evaluation for you.


task = Task(
    request="image_generation_quality",
    datamodule=PrunaDataModule.from_string('LAION256'),
    device="cpu"

This feature is ideal for teams that:

It’s designed to simplify model validation even for non-experts.

For more information, please read the documentation.