Theory Advanced 2 minute read Updated 2026-06-29 UTC

Model Merging Operators

A positive reference for SLERP, task arithmetic, TIES, DARE, WIDEN, adapter merges, and layer recipes as model-breeding operators.

Research statusTechnical synthesis Publication statePublished Reviewed byMichael Kappel Source reports3

Operators are reusable capability moves

A merging operator is a way to turn parent artifacts into a candidate descendant. Operators should be described plainly, tested under equal budgets, and preserved in lineage.

OperatorPositive purposeBest use
Linear averagingBlend compatible parent weights.Shared-base fine-tunes with similar behavior.
SLERPSmooth spherical blending through weight space.Parent models where geodesic interpolation is more stable than a straight line.
Task arithmeticAdd useful capability deltas to a base.Combining fine-tuned task vectors.
TIESPreserve strong shared directions while reducing interference.Multi-task merges with conflicting deltas.
DARESparse capability transfer through pruning and rescaling.Adapter or task-vector combinations where most deltas are not needed.
WIDENSeparate magnitude and direction signals.More controlled exploration of merge geometry.
Adapter mergeCombine compact LoRA or adapter deltas.Frugal skill transfer without storing many full models.
Layer recipeSelect or blend layers by role.Structural recombination when different layers hold different strengths.

Compatibility first

Merging is strongest when base family, architecture, tokenizer, tensor schema, and license constraints are compatible. When compatibility is weak, use distillation or routing instead of direct weight mixing.

pseudocode
FUNCTION choose_merge_operator(parents, goal)
    IF SAME_BASE_AND_TOKENIZER(parents) AND goal == "combine small task deltas"
        RETURN ADAPTER_MERGE_OR_TASK_ARITHMETIC
    END IF
    IF COMPATIBLE_FULL_WEIGHTS(parents) AND goal == "smooth blend"
        RETURN SLERP
    END IF
    IF DELTAS_CONFLICT(parents)
        RETURN TIES_OR_DARE
    END IF
    RETURN ROUTE_OR_DISTILL_INSTEAD
END FUNCTION

The operator catalog makes experiments easier to repeat and compare.

Source reports used for this guide

These reports are preserved verbatim in the site archive. The guide above is an editorial synthesis and may narrow, qualify, or reorganize claims from the source material.