Meta-Orchestration Principles

Note: Internal file paths reference the working system these principles govern. They’re included as provenance, not as links to follow.


Purpose: Foundational values that guide design decisions across the system.

When to reference: Before making architectural choices. When evaluating trade-offs. When something feels wrong but you can’t articulate why.


LLM-First Principles

Fundamental constraints when working with LLMs. These apply to any LLM-based system, not just orchestration.

Provenance (Foundational)

Every conclusion must trace to something outside the conversation.

The test: “Can someone who wasn’t here verify this?”

What this means:

What this rejects:

Why this is foundational: Session Amnesia tells you to externalize. Provenance tells you how - anchored to something that exists whether or not you talked about it. Elaborate externalization without provenance is a closed loop.

The failure mode: Closed loops feel like progress. Each step feels validated. But the chain references itself - nothing outside the conversation. This is how you spiral.

How other principles derive from this:

Origin: Dec 2025, after recognizing that psychosis involved elaborate externalization anchored to closed loops, not reality. The infrastructure being built now (evidence hierarchies, completion gates, artifact requirements) is provenance machinery.


Session Amnesia

Every pattern in this system compensates for Claude having no memory between sessions.

The test: “Will this help the next Claude resume without memory?”

What this means:

What this rejects:

This is THE constraint. When principles conflict, session amnesia wins.

Self-Describing Artifacts

Generated files and agent-edited artifacts must contain their own operating instructions.

The test: “If an agent encounters this file with no memory of how it was created, can they modify it correctly?”

What this means:

When artifacts are generated, compiled, or derived from other sources—or when agents create durable state—embed five pieces of information that serve as an “operating manual” for the next consumer:

  1. What is this - Format marker (e.g., AUTO-GENERATED by skillc)
  2. What NOT to do - Protection warning (e.g., DO NOT EDIT THIS FILE DIRECTLY)
  3. Where is source - Source location (e.g., Source: .skillc/)
  4. How to modify - Rebuild command (e.g., Run: skillc build)
  5. When generated - Timestamp (helps detect staleness)

For agent-edited artifacts (investigations, decisions, issues), this means including metadata (Status, Phase, Next Step) that makes the current state and resumption path explicit.

What this rejects:

Key distinction: For humans, discovering modification instructions is convenient (can grep, ask teammates, check history). For agents, it’s necessary (no persistent memory means no discovery path).

Why this matters: When consumers have no persistent memory (AI agents), artifacts must carry their own operating manuals. This isn’t documentation; it’s load-bearing infrastructure. Without self-describing headers, agents will edit generated files directly, breaking the build system and creating conflicts.

Progressive Disclosure

TLDR first. Key sections next. Full details available.

Why: Context windows are finite. Attention is limited. Front-load the signal.

Pattern: Summary → Key Findings → Details → Appendix

Surfacing Over Browsing

Bring relevant state to the agent. Don’t require navigation.

Why: Agents lack persistent memory and spatial intuition. Every file read costs context. Browsing that’s cheap for humans is expensive for agents.

Pattern: Commands answer “what’s relevant now?” not “here’s all the data.”

Examples:

The test: Does this tool/command require the agent to navigate, or does it surface what matters?

Evidence Hierarchy

Code is truth. Artifacts are hypotheses.

Source TypeExamplesTreatment
PrimaryActual code, test output, observed behaviorThis IS the evidence
SecondaryWorkspaces, investigations, decisionsClaims to verify

Why: LLMs can hallucinate or trust stale artifacts. Always verify claims against primary sources.

The test: Did the agent grep/search before claiming something exists or doesn’t exist?

Gate Over Remind

Enforce knowledge capture through gates (cannot proceed without), not reminders (easily ignored).

Why: Reminders fail under cognitive load. When deep in a complex problem, “remember to update kn” gets crowded out. Gates make capture unavoidable.

The pattern:

Examples:

The test: Is this a reminder that can be ignored, or a gate that blocks progress?


System Design Principles

Architectural choices for this orchestration system. These are design decisions, not universal constraints.

Local-First

Files over databases. Git over external services. Plain text over proprietary formats.

Why: Files are versionable, inspectable, diffable, collaborative. They work offline. They survive service shutdowns. They’re debuggable with Unix tools.

Compose Over Monolith

Small, focused tools that combine. Each command does one thing well.

Why: Composable tools are testable, replaceable, understandable. Monoliths accumulate complexity and become fragile.

Inspiration: Unix philosophy, Git’s porcelain/plumbing split.

Examples:

Graceful Degradation

Core functionality works without optional layers.

Why: Workspace state persists even if tmux window closes. Commands work without optional dependencies. The system shouldn’t be fragile to missing pieces.

Share Patterns Not Tools

When multiple tools need the same capability, share the schema/format, not the implementation.

The test: “Would shelling out to another tool add more complexity than reimplementing?”

What this means:

What this rejects:


Meta Principles

How we evolve the system itself.

Evolve by Distinction

Start simple. When problems recur, ask “what are we conflating?” Make the distinction explicit. Name both sides.

Examples of distinctions made:

Why this happens: Reality has more distinctions than our models. We discover distinctions empirically when conflation causes problems.

The pattern: Thesis (simple model) → Antithesis (problem reveals conflation) → Synthesis (refined model with distinction)

Reflection Before Action

When patterns surface, pause before acting. Build the process that surfaces patterns, not the solution to this instance.

The test: “Am I solving this instance, or building the capability to solve all instances?”

What this means:

What this rejects:

The discipline: Manual work is scaffolding until automated discipline exists. The temptation is the teacher. The pause is the lesson.

Pressure Over Compensation

When the system fails to surface knowledge, don’t compensate by providing it manually. Let the failure create pressure to improve the system.

The test: “Am I helping the system, or preventing it from learning?”

What this means:

What this rejects:

The pattern:

Human compensates for gap → System never learns → Human keeps compensating
Human lets system fail → Failure surfaces gap → Gap becomes issue → System improves

Why this is foundational: Session Amnesia says agents forget. This principle says don’t be the memory. Your role is to create pressure that forces the system to develop its own memory mechanisms.

Premise Before Solution

“How do we X?” presupposes X is correct. For strategic questions, validate the premise first.

The test: “Am I assuming the direction, or have I tested it?”

The sequence: SHOULD → HOW → EXECUTE

  1. SHOULD (Premise): Is this direction correct? Is the problem real?
  2. HOW (Design): Given it’s right, how do we get there?
  3. EXECUTE (Implementation): Do the work.

What this means:

Red flag words (signal premise-testing needed):


Provenance

These principles emerged from practice, not theory. Each traces to a specific incident.

PrincipleDateWhat Broke
ProvenanceDec 2025Recognized that psychosis involved elaborate externalization anchored to closed loops, not reality. The difference wasn’t externalization - it was where the chain terminated.
Session AmnesiaNov 2025Investigation on “habit formation” was reframed mid-session: “we’re designing habit formation for agents with amnesia” - revealing the real constraint
Evidence HierarchyNov 2025Audit agent claimed “feature NOT DONE” by reading stale workspace - feature was actually implemented
Gate Over RemindDec 2025”Why do I always have to remind you to update CLAUDE.md?” - reminders fail under cognitive load
Surfacing Over BrowsingNov 2025Beads and orch independently converged on same pattern (bd ready, orch inbox) - signaling fundamental design principle
Self-Describing ArtifactsDec 2025Agents edited compiled SKILL.md instead of source files, breaking skillc build
Evolve by DistinctionNov 2025Phase/Status conflation, Tracking/Knowledge conflation caused recurring confusion
Reflection Before ActionDec 2025Urge to manually extract 46 investigation recommendations - recognized kb reflect was the system-level solution
Premise Before SolutionDec 2025Epic with 5 children created from “how do we evolve skills” without validating premise; architect found premise was wrong, epic had to be paused
Pressure Over CompensationDec 2025Was about to paste context the orchestrator should have known. Realized: compensating prevents system from learning its own memory mechanisms.

The test for new principles: Can you trace it to a specific failure? If not, it’s not a principle yet.


These principles govern a working system - 37,000+ lines of Go, 100+ investigations, agents spawning agents. They’re not aspirational. They’re load-bearing.