Objective
Explore whether compatible parents can produce a smaller or stronger hybrid without losing important capabilities. The lab never loads arbitrary third-party artifacts into production and never treats a successful merge load as evidence of quality.
Parent eligibility
- verified source, digest, license, and data restrictions;
- compatible base family, tokenizer, layer shapes, and normalization;
- current evaluation cards for each parent;
- no unresolved critical safety or provenance issue;
- explicit reason the capabilities are complementary.
Search process
PROCEDURE merge_search(parent_a, parent_b, merge_space, budget)
ASSERT MERGE_ELIGIBILITY(parent_a, parent_b)
proposals <- SAMPLE_OR_OPTIMIZE_COEFFICIENTS(merge_space, budget)
candidates <- []
FOR each proposal IN proposals
artifact <- MERGE_IN_ISOLATION(parent_a, parent_b, proposal)
IF CHEAP_VALIDITY_GATES_PASS(artifact)
APPEND candidates, PACKAGE_WITH_LINEAGE(artifact, proposal)
END IF
END FOR
evidence <- EVALUATE_ALL(
candidates,
baselines = [parent_a, parent_b, OUTPUT_ENSEMBLE(parent_a, parent_b)]
)
RETURN SELECT_PARETO_FRONT(evidence)
END PROCEDURETest matrix
Evaluate both parent niches, conflicting tasks, rare capabilities, calibration, safety, long context, tool behavior if applicable, latency, memory, and quantization compatibility. Use newly created hidden cases to detect overfitting to public merge benchmarks.
Outcomes
A merge may become a candidate, remain an archived research artifact, or be rejected. The output ensemble is a legitimate winner if it preserves quality despite higher inference cost. Distillation is another alternative when a merge is unstable.
Safety
Use safe model formats, no arbitrary package code, disposable workers, network denied, artifact scanning, and strict storage quotas. Parent licenses and data restrictions combine conservatively.
Production path
Any promising merge enters the ordinary package, evaluation, shadow, and canary pipeline. The merge lab cannot change production aliases.
Source reports used for this guide
These reports are preserved verbatim in the site archive. The guide above is an editorial synthesis and may narrow, qualify, or reorganize claims from the source material.