Ablation studies — ModelBreeder.com

Why ablation matters

A model-breeding system has many moving parts. Without ablation, a successful run does not tell you what caused the improvement. Was it the mutation operator, the benchmark, the router, the archive, the release policy, or luck?

Components to ablate

Component	Turn it off by	Question answered
Mutation operator	remove one operator family	which operator produces useful descendants?
Quality-diversity archive	select only top aggregate score	does diversity help after drift?
Router learning	use static routing	does learning improve selection?
Judge gate	use direct specialist output	does judging improve quality enough to repay cost?
No-op threshold	promote all positive scores	does conservative stasis reduce regressions?
Human review	compare automated-only to reviewed release	which decisions need human judgment?

Ablation runner

pseudocode

FUNCTION run_ablation(system_config, ablations, benchmark)
    control <- RUN_EXPERIMENT(system_config, benchmark)
    rows <- []

    FOR ablation IN ablations
        modified <- APPLY_ABLATION(system_config, ablation)
        result <- RUN_EXPERIMENT(modified, benchmark)
        rows.APPEND(COMPARE(control, result, ablation))
    END FOR

    RETURN rows
END FUNCTION

Use paired evaluation

Use identical task streams, hardware, seeds, and policy thresholds where possible. If the task stream changes between runs, an apparent improvement may be noise.

Negative results are valuable

Ablations often show that a fashionable mechanism does not repay its cost. Preserve those results. They protect the ecology from repeating expensive experiments and help tune the viability weights.

Reporting standard

Each ablation report should include the removed component, expected effect, observed effect, confidence, cost delta, failure slices, and whether the result changes release policy.

Source reports used for this guide

These reports are preserved verbatim in the site archive. The guide above is an editorial synthesis and may narrow, qualify, or reorganize claims from the source material.

Evolutionary AIDesigning the “Perfect” Evolutionary AI SystemEstablished and emerging methods · 46.5 KB Evolutionary AIThe Architecture of the Perfect Evolutionary Artificial IntelligenceMixed maturity · 58.7 KB Core synthesisThe Four Fs of AI: Code Breeding, Model Breeding, and the Teleodynamic Convergence of Mutable Small-Model EcologiesConceptual synthesis · 80.5 KB

Why ablation matters

Components to ablate

Ablation runner

Use paired evaluation

Negative results are valuable

Reporting standard

Source reports used for this guide

Related guides

Evolutionary operators catalog

Routing experiments

Lineage experiments

Evolution lab