Safety Intermediate 1 minute read Updated 2026-06-26 UTC

Safety and governance

The invariants, threat models, containment, autonomy protections, and research boundaries required for adaptive model systems.

Research statusEditorial synthesis Publication statePublished Reviewed byMichael Kappel Source reports3

Adaptive does not mean unconstrained

A system can generate and select model descendants without giving any model a goal of self-preservation, unrestricted replication, privilege expansion, or authority over its evaluator. Safety comes from architecture and governance: external policy, bounded operators, isolation, independent evidence, reversible release, and human control.

Safety guides

Safety posture

The site uses a conservative default: candidates are untrusted, permissions are denied unless required, no-op is valid, production changes are progressive, and human operators can stop or reverse the process without model consent.

New safety additions

The safety section now includes Autonomy boundaries, Anti-dependency design, and Evidence ladder for public claims. These guides make clear which adaptive behaviors are useful, which require approval, and which should remain forbidden.

Source reports used for this guide

These reports are preserved verbatim in the site archive. The guide above is an editorial synthesis and may narrow, qualify, or reorganize claims from the source material.