Combination Engine

The Combination Engine is the heart of Pruna AI’s optimization framework. Forget manual trial-and-error or juggling a patchwork of tools. Pruna gives you a unified library of 50+ best-in-class algorithms, organized into ten powerful methods, all designed to squeeze maximum efficiency from your models.

Capture d’écran 2025-07-30 à 15.23.18.png

To give you an example, one of the base Smash_Config for the Diffusers model is combining 5 algorithms! Disclaimer: the "best" config always depends on your use case ;)


smash_config = SmashConfig()
smash_config["compiler"] = "torch_compile"
smash_config["torch_compile_target"] = "module_list"
smash_config["quantizer"] = "fp8"
smash_config["factorizer"] = "qkv_diffusers"
smash_config["cacher"] = "auto"
smash_config["auto_cache_mode"] = "midpoint"
smash_config["auto_objective"] = "quality"
smash_config["auto_speed_factor"] = 0.46
smash_config["pruner"] = "padding_pruning"
smash_config["padding_pruning_min_sequence_length"] = 128
smash_config._prepare_saving = False
smash_config.add_tokenizer(pipe_t2i.tokenizer_2)

While Pruna is open-source at its core, as a customer you get exclusive access to all state-of-the-art techniques, proprietary methods, and every future release, ensuring your models always stay ahead of the curve.