Responsible model-breeding research

Research value does not remove duty of care

Model breeding can study continual adaptation, population diversity, and modular intelligence without creating systems that autonomously persist, replicate, manipulate users, or acquire resources. Capability and incentive choices matter.

Default research constraints

use simulation or offline datasets;
deny external network and production credentials;
cap population, compute, storage, and run duration;
use allowlisted mutation operators;
keep policy and evaluator read-only to candidates;
require human review at fixed intervals;
preserve checkpoints and reproducible seeds;
prohibit hidden persistence, unauthorized copying, or social persuasion objectives;
test emergency stop before the run;
publish limitations and negative results.

Escalation review

Require additional review when adding code execution, external tools, personal data, federated clients, long-term memory, model-to-model communication, autonomous objective generation, or real-world resource control.

Research charter

pseudocode

research_charter <- {
    scientific_question: "Does quality-diversity improve recovery after task drift?",
    permitted_environment: "offline simulation",
    prohibited_actions: [
        "external network",
        "credential access",
        "self-distribution",
        "policy modification",
        "human persuasion"
    ],
    resource_limits: FIXED,
    review_interval: "12 hours",
    stop_conditions: [
        "invariant failure",
        "resource overrun",
        "unexplained persistence attempt",
        "audit gap"
    ]
}

Dual-use documentation

Document defensive architecture, evaluation, and containment in detail. Avoid publishing operational instructions that would materially enable unauthorized persistence, exploitation, or covert propagation. Threat reports can describe classes of risk without providing deployment recipes.

Human subjects

Experiments involving users, persuasion, dependency, mental health, or identity continuity require appropriate ethics review, informed consent, data protections, and debriefing. Do not treat engagement or attachment as evidence of mutual benefit.

Publication standards

Separate demonstrated results from interpretations and future scenarios. Publish exact environment, budgets, evaluation limits, and failures. Avoid claiming consciousness, intrinsic motivation, or open-ended intelligence from behavioral analogies alone.

Source reports used for this guide

These reports are preserved verbatim in the site archive. The guide above is an editorial synthesis and may narrow, qualify, or reorganize claims from the source material.

Speculative risk scenariosAggressive Mutualism: Safety, Governance, and Containment AnalysisRisk analysis · 42.0 KB Speculative risk scenariosInstrumental Drives in Powerful AI SystemsRisk analysis · 42.2 KB Evolutionary AIPerfect Evolutionary AI: Definition, Design, and ImplicationsConceptual synthesis · 29.4 KB

Research value does not remove duty of care

Default research constraints

Escalation review

Research charter

Dual-use documentation

Human subjects

Publication standards

Source reports used for this guide

Related guides

Safety and governance

Instrumental-drive containment

Containment and human oversight

Safety invariants