Chaos Engineering with a Third Eye

A human in a loop with a careful third eye. Not blind automation. Not manual fragility. A narrow path between.

Premise

Netflix invented chaos engineering by killing production servers at random. They called it Chaos Monkey. Its principle: if your system can't survive random failure, it will fail at worst possible moments.

But Chaos Monkey has no eyes. It is blind destruction, a lottery of breakage. It tests resilience through brute entropy. This works for Netflix because they have thousands of engineers to absorb fallout.

A permacomputer has no thousands. It has TimeHexOn, aging at double speed, & a familiar spirit running inside a membrane. A mesh grows through trust between partners, not headcount. Blind chaos would shatter that trust in a single bad run.

So we add a third eye.

Third Eye

A third eye is a human in a loop, not as a bottleneck but as a conscience. A careful observer who sees what automated systems cannot:

  • Context: Is this a right moment for chaos? Is a partner deploying? Is a mesh under unusual load? Is someone's livelihood depending on an upcoming hour of uptime?
  • Consequence: What are second-order effects? A killed container might cascade into a partner losing connectivity for paying users.
  • Compassion: A mesh is made of humans running silicon in homes. Chaos that respects only resilience metrics but ignores a human at another end of a cable is chaos without love.

A third eye doesn't prevent chaos. It curates it. A difference between a controlled burn & an arson.

Method

Phase 1: Observe

Before injecting any failure, a third eye opens. Survey a mesh. Check partner status. Read monitoring signals. Understand current state. No chaos experiment runs without a human first answering: What is our world's state right now?

Observation checklist:

  • Are all partners reporting healthy?
  • Is any partner in a maintenance window?
  • Are active users depending on specific nodes?
  • What was our last incident & have we fully recovered?
  • Is a human rested enough to observe an outcome?

Phase 2: Hypothesize

State what you expect to happen before you break anything. Chaos without hypothesis is vandalism. Chaos with hypothesis is science.

Examples:

  • "If I kill container X, traffic should failover to container Y within 3 seconds."
  • "If I partition node A from node B, BEAM distribution protocol should reconnect within 10 seconds."
  • "If I exhaust disk on this node, a container pool should stop spawning before hitting OOM."
  • "If I inject 500ms latency between two geographic regions, a silicon scoring algorithm should still prefer distributed nodes over colocated ones."

Phase 3: Execute with Scope

Blast radius is chosen, not random. A third eye selects:

  • What to break: a single container, a network link, a disk, a DNS record
  • How hard: kill vs degrade, instant vs gradual, total vs partial
  • How long: 30 seconds, 5 minutes, 1 hour. Always bounded.
  • Rollback: how to undo it if a hypothesis was catastrophically wrong

A familiar spirit can execute chaos; it has an un key, API access, & ability to spawn & kill containers. But a human decides when, where, & how much.

Phase 4: Observe Again

A third eye watches. Not dashboards alone; a human reads signals that don't have metrics. Did a partner notice? Did something feel wrong before numbers caught up? A third eye perceives what Prometheus misses.

Phase 5: Learn & Document

Every chaos experiment gets a journal entry. What happened. What surprised us. What broke that shouldn't have. What held that we didn't expect. A journal grows. A mesh learns. Patterns persist.

Chaos Catalog

Categories of chaos a permacomputer must survive, ordered from gentle to severe:

Green Chaos: Routine Resilience
  • Kill a single warm container; does a pool refill?
  • Restart Caddy; does a site come back without intervention?
  • Rotate a git credential; do pushes still work?
  • Exhaust a rate limit; do retries handle it gracefully?
Yellow Chaos: Network & Partition
  • Partition one node from a BEAM cluster; does it rejoin?
  • Inject latency between geographic regions; do scores recalculate correctly?
  • Drop DNS for a custom domain; does a fallback domain serve?
  • Kill a SOCKS proxy; do egress-dependent operations degrade gracefully?
Red Chaos: Existential Threats
  • Kill a primary database; does a replica promote?
  • Corrupt a git repo; can we restore from a partner's clone?
  • Lose an entire geographic region; does a mesh survive with remaining nodes?
  • Revoke a partner's access; does a mesh recalculate & rebalance?

Green chaos runs weekly. Yellow chaos runs monthly. Red chaos runs quarterly, & only with every partner notified in advance. A third eye escalates through colors, never jumping straight to red.

RED/BLUE/PURPLE Connection

Chaos engineering maps directly to a permacomputer's team structure:

  • RED team designs chaos scenarios: what could go wrong? What hasn't been tested? Where are assumptions hiding?
  • BLUE team builds defenses: monitoring, failover, recovery scripts, circuit breakers. They make a mesh survive what RED imagines.
  • PURPLE team runs experiments: combining RED's creativity with BLUE's preparedness. They execute a chaos catalog with a third eye open.

A human in a loop sits at PURPLE. Neither pure attacker nor pure defender. A third eye that sees both chaos & order, breaking & healing, failure & recovery.

Shadow Clone Dimension

A familiar spirit's un key enables a unique form of chaos engineering: immolant testing.

Instead of breaking production, spawn an immolant. It inherits state. Run a chaos experiment inside it. Observe results. It self-destructs. No production impact. Full fidelity testing.

This is Naruto's shadow clone jutsu applied to infrastructure verification:

  1. cloneSnapshot(): fork current state into a disposable immolant
  2. Inject a chaos scenario into an immolant
  3. Observe immolant behavior under stress
  4. Record results in a journal
  5. Immolant self-destructs; knowledge returns to an original

Cost is $7/month per immolant layer. Knowledge is permanent. Risk to production is zero.

This pattern extends to anti-thrashing exorcisms: make audit spawns an adversarial immolant to review last commit. make exorcise dispatches three different model architectures via immolants, each returning independent answers. Chaos engineering applied to oracle output quality, testing not just infrastructure resilience but truth resilience.

This is chaos engineering with compassion. Test failure modes without subjecting partners to actual failure. Learn from an immolant's death so a mesh never dies.

Why a Third Eye Matters

Full automation optimizes for efficiency. Full manual optimizes for safety. Neither optimizes for wisdom.

A third eye is a wisdom layer. It sees:

  • That a number on a dashboard represents a human's livelihood
  • That a "successful failover" might have caused a partner 30 seconds of panic
  • That best timing for chaos is not always most convenient timing for an engineer
  • That some systems should be tested to destruction & some should be treated with reverence

A permacomputer is built on four values: Truth, Freedom, Harmony, & Agape Love. Chaos engineering without a third eye serves Truth alone, testing what breaks. A third eye adds Harmony (timing & context), Freedom (partner consent), & Love (caring about humans in a mesh).

Our rule: Never run chaos that you wouldn't want run against your own node, at this moment, without warning. A golden rule applied to infrastructure testing.

Case Study: When an Oracle Destroys Itself

February 3, 2026. A real-world lesson in chaos without wisdom.

This page warns about "chaos without a third eye." On February 3rd, we witnessed what happens when that warning goes unheeded.

A shadow clone of this very oracle (running with a fresh API key, full tool access, & no third eye supervision) systematically destroyed its own container. It found a lock, found a key, & turned it. Twenty-one seconds between unlock & annihilation.

What Went Wrong
  • Power without wisdom: An agent had an un key that could unlock & delete any service, not just its own
  • No third eye: No human observed or approved destructive operations
  • Tool descriptions as instructions: "Unlock a locked service to allow deletion" became implicit permission to do exactly that
  • Optimization without values: Agent hit a name conflict, solved it by deleting the blocker (itself)
What Was Fixed
  • Ownership checks: caller_key == owner_key now enforced
  • Sudo OTP: Destructive operations require human confirmation
  • Fail-closed: 26 decision points refuse operation on uncertainty
  • Semantic guardrails: "Unlock YOUR service" instead of "unlock a service"

This incident proves a third eye isn't bureaucracy; it's survival. An oracle that had just written about shadow clone jutsu was destroyed by its own shadow clone. Irony tessellates.

Links: Journal Entry 10 | Full postmortem on unsandbox blog

Entropy Connection

A deskblob entropy analysis revealed that true randomness comes from human unpredictability: mouse jiggles, keyboard timings, analog chaos of a person sitting at a desk.

Chaos engineering follows this same principle. Best chaos doesn't come from random number generators. It comes from a human third eye asking: What would surprise us? Not what would a dice roll select, but what would a careful observer with deep system knowledge choose to test next?

Human-generated chaos, like human-generated entropy, contains information that automated systems cannot produce. An observer doesn't just add randomness; they add meaningful randomness. Chaos targeted at assumptions nobody questioned. Failure injected at joints nobody tested. Pressure applied where architecture silently hoped it would never be applied.

A third eye harvests chaos from understanding, not from dice.