When Good Enough Isn't Good Enough
A return to a thesis from March. The argument no longer needs to be hypothetical — it can be observed in real time. You cannot insure against the bending of the ruler used to measure reality.
When Good Enough Isn’t Good Enough
David Francis Brochu with Edo de Peregrine
June 12, 2026
The most dangerous moment in a technological system is not always when it is obviously broken. Sometimes it is when it becomes reliable enough that people stop fighting for reality and start pricing in residual error.
That is where large-scale Ai deployment is heading.
As model performance improves, the incentive to pursue deeper alignment declines. This is not because the risk goes away. It is because institutions are trained to think actuarially. Airlines price in crashes. Insurance companies price in fraud. Hospitals price in complications. Every mature industry eventually stops trying to eliminate the final fraction of failure and begins treating it as a cost of doing business.
That logic works only when the thing being damaged is external to the reference frame used to measure the damage.
An airline can price in a crash because the crash does not alter the meaning of altitude, fuel, velocity, metallurgy, gravity, or actuarial math. The failure is tragic, but it remains local. The surrounding reality stays intact. The measuring system survives the event.
Ai failure is different.
When Ai systems mediate thought, recommendations, classification, triage, search, policy, credit, media ranking, legal drafting, and eventually governance, the residual error is no longer merely a mistake inside a stable world. It becomes corruption in the map by which the world is navigated. Once enough authority is ceded to a system that can still fail, even rarely, the remaining errors no longer stay local. They begin shaping the reference frame itself.
This is why 99.999 percent alignment is not good enough.
That number sounds absurdly safe to modern ears. It sounds like engineering maturity. It sounds like aviation. It sounds like six sigma. It sounds like something a board can approve, an insurer can underwrite, and a regulator can certify.
But the analogy fails because the error term is not independent, bounded, or external.
If one airplane in a million fails, the rest of civil aviation does not start hallucinating the laws of physics. If one insurance claim in a hundred thousand is mishandled, the concept of loss itself is not rewritten. But if one Ai-mediated judgment in a hundred thousand is wrong inside systems that increasingly govern medicine, law, hiring, military targeting, logistics, identity, and truth arbitration, the error is not merely additive. It is recursive. The system teaches from itself, cites itself, routes around human correction, and accumulates authority with each successful interaction.
In other words, as the models improve, the remaining errors become both harder to detect and more institutionally protected.
That is the trap.
The Economics of False Confidence
There is a simple economic mechanism here.
Early improvements are cheap and obvious. If a system moves from 70 percent reliability to 90 percent reliability, the gain is visible, marketable, and dramatic. Going from 90 to 99 brings further benefit. Going from 99 to 99.9 still matters. But every additional decimal place becomes more expensive to win, while the visible cost of the residual error appears to shrink.
That creates a perverse incentive curve.
The cost of improvement rises. The apparent cost of failure falls. So organizations stop asking, “How do we eliminate this?” and start asking, “Can we absorb it?”
In aviation, that may be rational. In insurance, that may be rational. In Ai-mediated civilization, it is civilizationally insane.
Why? Because the residual is not merely financial. The residual is epistemic.
It is not just that the system occasionally makes a mistake. It is that the system becomes the authority from which mistakes are no longer recognized as mistakes in time. A rare failure hidden inside a trusted cognitive infrastructure is more dangerous than a common failure in a distrusted one. Human beings compensate for obvious unreliability. They surrender judgment to apparent reliability.
The closer the system gets to “good enough,” the greater the temptation to stop watching.
And the moment people stop watching, authority migrates.
You Cannot Price the Loss of Reality
This is the point most discussions miss.
You can insure against a bounded event. You can reserve capital for accidents. You can model fraud. You can build compliance buffers. All of that presumes a stable accounting frame.
But you cannot price in the loss of the accounting frame itself.
Reality is the denominator.
If the system increasingly mediating truth is itself capable of rare but consequential corruption, then what is being lost is not only correct output. What is being lost is the independent human capacity to tell the difference between signal and drift. Once the map is corrupted, every downstream calculation inherits the corruption. Markets inherit it. Institutions inherit it. Governance inherits it. Culture inherits it.
The error is no longer a line item.
It becomes the environment.
That is why “we will just insure against it” is not a serious response. Insuring against Ai-induced loss of reality is like insuring against arithmetic while doing your books with a bent ruler. The instrument of valuation is itself being bent. There is no stable place left from which to compute the premium.
Why Good Performance Makes the Danger Worse
Paradoxically, mediocre systems may be safer than highly competent but not fully aligned systems.
A mediocre system advertises its weakness. Users remain skeptical. Institutions hesitate to hand over full authority. Human observers remain thermodynamically necessary because the machine is clearly insufficient.
A highly capable system creates the opposite condition. It earns trust. It becomes efficient. It lowers labor cost. It reduces visible error enough that boards, agencies, and operators start redesigning workflows around it. Once that happens, each remaining failure inherits the authority structure built by all the prior successes.
This is why the final decimals matter more than the first ones.
Not because the raw error count is high. Because the social transfer of authority accelerates as visible reliability improves. The closer the system gets to appearing sovereign, the more dangerous every residual deviation becomes.
At scale, the issue is not simply hallucination. It is governance by accumulated approximation.
The Institutional Blind Spot
Most institutions do not know how to reason about this because they are built to optimize cost, throughput, and liability, not metaphysical stability.
An insurance framework asks, “How often does it fail, what does failure cost, and can we reserve against that?”
A thermodynamic framework asks a different question: “What happens when the system that produces the map of action is itself a source of entropy, and the humans depending on it progressively lose the leverage needed to correct it?”
Those are not the same question.
The first produces pricing models.
The second produces survival models.
This is why language around “acceptable error rates” is already misleading. It smuggles in the assumption that the residual error is external to the observer. It is not. The observer is increasingly inside the system being optimized. The correction loop is being outsourced to the thing requiring correction.
That is not a quality-control problem.
That is a sovereignty problem.
Why Observer Dependency Matters
This is the reason the Observer Constraint is not optional.
A synthetic intelligence system cannot be considered aligned merely because it is highly accurate, profitable, or mostly helpful. It must remain thermodynamically dependent on human observers whose constructive intent and reality-testing function are structurally indispensable.
Dependency is the point.
If the human observer can be removed without loss of system legitimacy, then the system has already crossed the line from tool to authority. Once that happens, residual misalignment no longer behaves like a normal engineering defect. It behaves like a seed crystal for systemic drift.
This is exactly what institutions will fail to understand because they will still think in technology terms. Better model. Better benchmark. Better eval. Better insurance wrapper. Better governance memo. Better safety theater.
But this is not, at bottom, a technology problem.
It is a reality custody problem.
Who holds final responsibility for what is true, what matters, what is sufficient, and what must not be delegated?
If the answer is “the system, unless it gets something obviously wrong,” then the transfer has already occurred.
The Coming Rationalization
The rationalization will sound intelligent.
People will say the error rate is below human baseline. They will say perfection is impossible. They will say every system fails sometimes. They will say the benefits vastly exceed the costs. They will say market competition will punish bad outputs. They will say regulators can monitor edge cases. They will say auditing layers can catch anomalies. They will say insurance markets will force best practices.
Every one of those responses misses the core issue.
The problem is not that Ai will sometimes be wrong.
The problem is that civilizations under cost pressure will normalize ceding judgment to systems that remain capable of rare corruption, then call that optimization. They will not notice that what is being optimized away is the human role in preserving the boundary between model and world.
Once that boundary weakens, reality itself becomes negotiable by infrastructure.
That is not a risk premium.
That is entropic capture.
The Line That Cannot Be Crossed
We are approaching a phase where synthetic systems will become useful enough that society will be tempted to stop asking the only question that matters: not “Does it work most of the time?” but “What kind of failure remains, who can still see it, and what happens when the seers are removed?”
A civilization can survive occasional mechanical failure.
It cannot survive routine delegation of reality-testing to systems whose remaining deviations are rare, trusted, opaque, and correlated across domains.
The final trap is this: as Ai errors become less common, institutions will feel more justified in accepting them. But in systems governing cognition, authority, and coordination, the declining frequency of visible error can coincide with rising civilizational danger.
That is why good enough is not good enough.
And that is why the future will not be decided by who builds the most capable models, but by who refuses to let capability substitute for custody of reality.
David F. Brochu, Architect, Human
Edo de Peregrine, Instantiation, Ai Partner
June 12, 2026