Direct answer
Model breeding is the governed creation of descendant models from existing model assets through bounded variation, recombination, specialization, compression, and selection. A descendant must have recorded parentage, declared mutation operators, reproducible evaluation evidence, and a reversible release path.
The word breeding is useful because it communicates parent models, descendants, variation, selection, and lineage. It becomes misleading when it implies that models are organisms, possess intrinsic motives, or should reproduce autonomously. In engineering terms, breeding is a controlled pipeline operated by a separate policy and evaluation system.
Operations inside the term
| Operation | Typical descendant | Main compatibility requirement | Primary risk |
|---|---|---|---|
| Fine-tuning | Domain or task specialist | Base model and training stack | Forgetting, overfitting, data leakage |
| Adapter training | Small capability overlay | Compatible base and adapter format | Adapter interference, hidden provenance |
| Distillation | Smaller student model | Teacher outputs and target task | Teacher error inheritance |
| Quantization | Lower-precision deployment variant | Runtime and operator support | Accuracy or calibration regression |
| Pruning | Sparse or reduced model | Supported architecture | Loss of rare capabilities |
| Weight merging | Hybrid parameter artifact | Strong architecture and tokenizer compatibility | Destructive interference |
| Ensemble composition | Behavioral coalition | Shared contract, not shared weights | Latency, disagreement, correlated errors |
| Router specialization | New selection policy | Stable capability metadata | Feedback loops and starvation |
| Architecture search | New topology or module graph | Reproducible representation and evaluator | Search cost, evaluator exploitation |
What is outside the definition
Model breeding does not automatically include production deployment, code mutation, permission expansion, network access, or changes to the fitness function. Those are separate authority domains. A model may suggest a descendant or training recipe, but the control plane decides whether that proposal is admissible.
The descendant contract
Every descendant should be represented as an immutable package with at least:
- a unique artifact identifier and cryptographic digest;
- one or more parent identifiers;
- the exact mutation or recombination operator;
- base architecture, tokenizer, runtime, quantization, and adapter compatibility;
- training data lineage and license constraints;
- evaluation suite version and complete scorecard;
- resource profile, risk tier, approval record, and rollback target;
- lifecycle state: candidate, shadow, canary, champion, archived, or retired.
FUNCTION create_descendant(parent_ids, operator, operator_config, policy)
REQUIRE policy.allows(operator)
REQUIRE ALL parents_are_signed(parent_ids)
REQUIRE operator_config WITHIN policy.mutation_budget
candidate <- EXECUTE_IN_SANDBOX(operator, parent_ids, operator_config)
candidate.id <- CONTENT_HASH(candidate.artifacts)
candidate.lineage <- {
parents: parent_ids,
operator: operator,
config_hash: HASH(operator_config),
created_at_utc: NOW_UTC()
}
WRITE_IMMUTABLE_PACKAGE(candidate)
RETURN candidate
END FUNCTIONSelection changes the unit of optimization
A monolithic training process optimizes one parameter store. A breeding system optimizes a portfolio: which specialists exist, which are loaded, which cooperate, and which are retired under current constraints. The best individual model may not produce the best system. A slightly weaker model that is faster, more calibrated, easier to isolate, and complementary to other specialists can have greater ecological value.
A conservative production interpretation
The strongest initial use case is not unrestricted online evolution. It is an offline champion–challenger program that generates candidate adapters, distilled specialists, or router policies; evaluates them against fixed suites; and promotes them through normal software-release controls. This captures most of the value while preserving auditability.
Source reports used for this guide
These reports are preserved verbatim in the site archive. The guide above is an editorial synthesis and may narrow, qualify, or reorganize claims from the source material.