Research · thesis All researchWitness and Verification

Witness and Verification Under Bounded Rationality

A wrap-up thesis · conferred status has no aseity, and neither does a verdict.

Status: reviewable draft, 2026-06-30 / back to Research

The claim in one breath

A verdict binds only where its criterion is externally witnessed and re-derivable. Beyond that boundary it is an unwitnessed bid: a claim certified by present confidence, not by an external check. This boundary is the honest shape of verification, not a defect in it.

This page wraps up four recent studies and ties them to the reconcile thesis. Every non-trivial claim below carries either a falsification condition or an evidence link to a source record. Proof before trust, applied to this thesis itself.

Author of record: Zain Dana Harper. The attribution trail is git history, the cited record paths, and the corpus seals quoted from those records.

The one-paragraph claim

Two recent forges (Learning and Discovery) and a long research diet all keep running into the same wall, and the wall is the point. A made thing has no aseity: it does not stand on its own footing, it is held in being by something outside it. A verdict is exactly such a made thing. A verdict that says MATCH binds precisely in the domain where the criterion it was checked against was authored outside the checker and can be re-derived by anyone. Step one inch past that domain and the same verdict becomes an unwitnessed bid: a claim certified by present confidence, not by an external check.

The honest forges already behave this way. The Learning Forge grounds five of ten modules in hashed sources, leaves two partial, and names the remaining three as ungrounded and in need of gathering; across all ten it keeps every claim-card verdict slot empty until a judge has run. The Discovery Forge marks its convergence claims UNVERIFIABLE and parks the seven-loop calculation as out of reach. The one place either forge crossed from bid to bound, the C3 thermodynamic result, did so only on the leg where an external criterion existed (a standard linear solve), and even there it bound only the simulated leg and left the physical-device leg UNVERIFIABLE. That asymmetry is not a series of failures to be cleaned up later. It is what verification looks like when it is honest about its own reach.

The spine, stated plainly

This thesis has three load-bearing claims. They are connected: each is the previous one applied one level out.

Claim 1 · Conferred status has no aseity, and neither does a verdict.
  • The conferred-existence thesis defends ontological nihilism in its only survivable form: no aseity, the denial that anything is self-standing. Existence, status, standing, the moral ought, and legitimate authority all share one ontology. They are conferred, relational, re-spoken each instant, with no aseity, yet no less real or binding for it. The thesis is honest enough to turn this on itself: by its own anti-realism, its central claims are bids certified by present power until earned.
  • A verdict is a made thing of exactly this kind. When a verifier emits MATCH, that MATCH does not stand on its own. It is held in being by a criterion outside the verifier. Remove the external check, or move to a domain where no external check exists, and the verdict reverts to what it always was underneath: a bid. This is not a metaphor borrowed from the conferred-existence thesis. It is the same structure, the ground-versus-conferral separation, now seen in the verifier.
Claim 2 · A verdict binds precisely where its criterion is externally witnessed and re-derivable. Beyond that, it is a bid.
  • This is the reconcile thesis stated as a boundary. The reconcile is one operation: perceive a shape, check it against a criterion it did not author, carry a re-checkable proof. The Discipline, in the Discovery Forge record: a result must be SIMULATED or derived and then checked against a criterion it did not author, not merely RENDERED or asserted.
  • The criterion must be two things at once for the verdict to bind: authored outside the checker (so the check is not the checker grading its own homework) and re-derivable by an independent party (so the check can be repeated and can fail). Where both hold, MATCH is a bound claim. Where either fails, the verdict is an unwitnessed bid, and the honest output is UNVERIFIABLE.
  • This mirrors the conferred-existence thesis at its hardest: the ought binds while the standpoint is inhabited, and is declinable by withdrawal from it. Read criterion's domain where the thesis writes practical standpoint and the modal structure is identical. The bindingness is real and the boundary is real, and the boundary is not a wound.
Claim 3 · The boundary is the honest shape of verification, not a weakness.
  • The Learning Forge's ungrounded modules and the Discovery Forge's UNVERIFIABLE convergence claims are the correct limits of present reach, stated plainly rather than papered over. The C3 result shows, by contrast, exactly what a bound claim looks like when the criterion is present: a measured MATCH on the simulate leg, with the render leg still honestly UNVERIFIABLE. The rest of this thesis is the evidence for Claim 3, study by study.

Study 1 · The Learning Forge: empty verdict slots are the honest state

The Learning Forge turns frontier AI talks and papers into learning objects. A learning object is not a summary: it carries its own evidence, including a source with a hash and a claim with a verdict slot a separate judge can later fill MATCH, DRIFT, or UNVERIFIABLE. The corpus is sealed and re-verifiable: a digest seal, a run seal, and stored items each retrievable by its sha256. The hash in a card is the retrieval key, so any reader can re-derive the inventory. That is the witnessed, re-derivable part.

Here is the load-bearing honesty. Every one of the six claim cards carries an intentionally empty verdict slot. The record states why in one sentence that is the whole thesis in miniature: grounding a claim in a hashed source is not the same as adjudicating it. The Forge gathers and cites; the judge judges. Keeping the slot empty is the honest state.

This is Claim 1 in operation. A grounded citation is not a verdict. The card has a witnessed source (the hash), but no external criterion has yet checked the claim, so the card cannot bind. To fill the slot with MATCH before the judge has run would be to confer a verdict that has no ground: a bid dressed as a bound claim. The forge refuses, and that refusal is correct. The scoping note goes further: of ten modules, five are solidly evidence-backed, two are partial, and three have little or nothing on-point and need sources gathered. The forge does not invent coverage for the three ungrounded modules. It names them as ungrounded.

Proven
The corpus is sealed and re-derivable; six cards are grounded in specific hashed sources.
UNVERIFIABLE
All six claim cards. None has been adjudicated. The record says so explicitly and leaves the slots empty.
Falsifier
If a reader re-runs the corpus list and digest and the inventory or seals do not match, the grounding claim is refuted. If the module tally is not five grounded / two partial / three ungrounded, the summary here is refuted. If a card's slot is found filled with a MATCH that no judge ever produced against an external criterion, that card is a bid masquerading as a verdict and should revert to UNVERIFIABLE.

The Learning Forge is the cleanest case of the spine because nothing in it crossed from bid to bound. It is the honest floor: gather, cite, and wait for the judge.

Study 2 · The Discovery Forge: three falsifiers, and one verdict that became real

The Discovery Forge is the assembly line that turns a research source into a discovery object that carries its own re-check. Its intake rule is the falsifier rule: a card with no falsifier is rejected at intake. This is Claim 2 enforced at the door. A claim that cannot name a criterion it did not author, and a way that criterion could fail, is not admitted, because it could never bind.

Before the cards, the record honors a refusal. The convergence-thesis mapping (the claim that the research talks support the system's design) was run through the judge and returned UNVERIFIABLE on all six claims. The reason: the judge has no oracle that can measure whether a physics talk supports a design. It correctly refuses to assert a MATCH it cannot ground. This is Claim 1 and Claim 2 together. The mapping is interpretive: there is no external, re-derivable criterion that can adjudicate it, so the only honest verdict is UNVERIFIABLE, and the verifier's refusal is itself the re-checkable artifact. The interpretation is faithful and labeled, not measured.

The three falsifiers, anchored

The Discovery Forge stakes its discipline on three named, dated, or computable falsifiers. Each is a criterion the forge does not author.

  • Falsifier 1 · a dated neutrino-mass prediction. If one right-handed neutrino is stable and is the dark matter, then the lightest neutrino is massless, which forces a predicted minimum sum of neutrino masses. The test is an external, dated measurement the forge does not control: track the published upper bound on the sum of neutrino masses from galaxy-clustering surveys over a stated three-to-four-year window. The prediction is falsified if the measured upper bound drops below the stated minimum, and confirmed if a bump appears at that level. This is the cleanest verifier target in the corpus precisely because it is dated and externally re-checkable. Its verdict slot is UNVERIFIABLE now, by construction: the data over the window is not yet in hand.
  • Falsifier 2 · the thermodynamic stochastic process. Detailed in Study 3 below. This is the one falsifier that flipped to a real verdict.
  • Falsifier 3 · a seven-loop quadratic-gravity calculation. Quadratic gravity, in a limit, embeds into a scalar field theory known to seven loops on that side; quantum gravity in this limit is known only at one loop. The criterion is the independently computed seven-loop result, with each loop order a checkable target the forge does not author. The verdict slot is UNVERIFIABLE: the system has no symbolic engine that can do seven-loop perturbation theory today. It is parked honestly as a well-posed, externally checkable goal that is out of current reach, kept rather than discarded because being out of reach is not the same as being ill-posed.

A fourth card uses an experimental method as a template rather than a physics prediction the forge will measure; it is marked interpretive and its slot is UNVERIFIABLE. It is the pattern, not a number. These falsifiers map onto the spine exactly. The first binds in the future, when an external measurement arrives. The third binds in principle but not now, because the checker (a symbolic engine) does not yet exist. The second binds now, on one leg, because the external criterion (a linear solve) is available now. Same boundary, three positions along it.

Proven
The corpus of eight talks is sealed and verified MATCH; the judge's refusal on all six convergence claims is on the record.
UNVERIFIABLE
The talk-to-design convergence mappings (no oracle); the neutrino card (data not yet in hand); the method-template card; the seven-loop card (no engine in reach).
Falsifier
If a future card is admitted without a named falsifier, the intake gate has been breached and the forge has stopped being a forge.

Study 3 · The C3 result: what a bound claim looks like

C3 is the hinge of this whole thesis, because it is the one place either forge crossed from bid to bound, and it shows the boundary precisely by binding on one side of it and not the other.

The claim, grounded from a research talk: a thermodynamic chip is built so that the chip itself is a stochastic differential equation, and as its own noise settles it behaves sort of according to the inverse of a matrix. The simulate leg of this claim is testable now with no external party, because the criterion is authored by linear algebra, not by the process being simulated.

The method is re-derivable: simulate an Ornstein-Uhlenbeck process for a symmetric positive-definite matrix A (n=4, condition number 2.33), 6000 chains, 40000 steps. The stationary mean should equal the linear solve and the stationary covariance should equal the matrix inverse. Reference values come from a standard numerical solver. The record names the reproduction script. The measured result against a principled five-percent bar:

  • Stationary mean vs the linear solve: relative error 0.99 percent, well inside the five-percent bar. MATCH.
  • Stationary covariance vs the matrix inverse: relative error 3.3 percent, inside the bar. MATCH.

The two relative-error numbers and the five-percent bar are the robustly re-derivable quantities: re-running the script reproduces them directly. The recorded judge margins (0.802, 0.340) are reproduced from a single record line and are the weakest evidence link in this thesis, because the record names the script but does not exhibit the margin computation itself; treat the margins as the record's reported numbers, not as independently re-derived here.

This is what binding looks like. There is a perceived shape (the simulated stationary statistics), a criterion the checker did not author (the exact inverse from a standard solver), and a re-checkable proof (the script, the seeds, the seals, the measured numbers). All three legs of the reconcile are present, so the verdict MATCH binds. The 0.99 percent mean error against a five-percent bar is not rhetoric: it is a number anyone can reproduce and that could have come out the other way.

And here is the boundary, stated by the record itself: this confirms the simulate leg only. The mathematical claim, that the process settles to the inverse, is now a re-checkable MATCH. Whether a physical thermodynamic chip realizes this within its noise and device limits is the render leg and remains UNVERIFIABLE here. The mathematical claim is bound. The physical-device claim is an unwitnessed bid, because the criterion for it (measurements from a real chip under real noise) is not in hand. One claim, two legs, and the verdict splits exactly along the line where the external criterion stops being available.

Proven
The simulate leg of C3, as a re-checkable MATCH with measured relative errors of 0.99 percent (mean) and 3.3 percent (covariance) against a five-percent bar, reproducible via the named script and sealed. The relative errors are the independently re-derivable quantity; the margins are the record's reported figures, not re-derived here.
UNVERIFIABLE
The render leg. Whether a physical thermodynamic chip realizes the mechanism within device limits has no measurement here. The record marks it UNVERIFIABLE and so do I.
Falsifier
Re-run the script against a fresh symmetric positive-definite matrix; if the stationary mean and covariance do not recover the linear solve and the inverse within tolerance, the simulate MATCH drifts and must be downgraded to DRIFT. If anyone cites C3 as evidence that a physical chip works, that citation has crossed the simulate/render line and overclaims.

Study 4 · The research diet: the same boundary, found independently in physics

The research diet read eight talk transcripts, sealed them, and mapped them to the system's design with the mappings marked interpretive. The value here is not the mappings (the judge refused them, as Study 2 noted). The value is that several talks, independently, exhibit the bid-versus-bound boundary in their own domains. Two are faithful to their source and worth keeping.

  • The experimental turn in a foundations dispute. One landmark result turned an interpretive dispute into an experiment: a class of hidden-variable theories implies bounded correlations; the competing theory predicts stronger, measurable violations. The experimental arc closed loopholes one at a time over decades. This is the bid-to-bound transition in physics: a claim that was an unwitnessed bid became a bound verdict once a criterion was built that could fail and could be repeated. The honest note: using it as a system pattern is still UNVERIFIABLE, because that use has not itself been run through any measurement. The physics is settled; the analogy to the system is a labeled interpretation. That distinction is the thesis.
  • We do not know the rules of the game. One talk pairs a falsifiable, dated prediction with repeated honest caveats that we do not know the rules of the game. The diet names this caveat correctly: it is the working scientist's version of UNVERIFIABLE. A scientist who states a dated falsifier and, in the same breath, names the part he cannot yet ground is doing exactly what the forges do: bind where the criterion exists, mark UNVERIFIABLE where it does not.

The cross-talk synthesis reaches the same place from several talks at once: carry uncertainty explicitly, a partial pass is not a pass; auto-formalization and prove-or-disprove map onto MATCH / DRIFT / UNVERIFIABLE; turn a dispute into a re-checkable measurement, then close loopholes. These are not inventions of this project. They are how careful physics already works, and the forges are an attempt to run the same discipline on the system's own claims.

One honest caution carried from the diet: the cross-talk mappings are the operator's interpretation, and the judge declined to certify them. So this study cites the talks as faithful readings, not as evidence that physics verifies the design. The transcripts are sealed and re-checkable (ten items verified MATCH). The interpretation is labeled as interpretation.

Proven
The eight transcripts are sealed and verified MATCH; the physics quoted is faithful.
UNVERIFIABLE
Every talk-to-design mapping (the judge refused all six; no oracle exists to measure them).
Falsifier
If a reader re-runs the seal against the corpus and the ten-item MATCH count does not hold, the provenance claim is refuted. If this study is ever cited as proof that the talks verify the design, it has been misread against the record's own warning.

Study 5 · The viable-visualization thesis: the same boundary, made quantitative

The viable-visualization thesis is included because it is the one place the boundary of this thesis is given a measure. Its central honest object is not completeness but coverage under an explicit scope. A finite perceiver has finite variety; mathematics does not; so no finite visualizer can cover all of mathematics. The system is incomplete over all mathematics but viable over any bounded class it declares. When it cannot meet the bound, it emits the same fail-closed UNVERIFIABLE the reconcile spine already emits.

That is Claim 3, made quantitative. The boundary between bound and bid here is the boundary between the channel carrying enough variety to recover the criterion-invariant and not carrying it. The thesis ran sixteen experiments that attacked its own clauses adversarially. Two are worth quoting because they are the same self-correction discipline:

  • One experiment downgraded the thesis's own one-conservation-law claim to three mechanisms with one necessary variety bound. The record states it plainly: the experiment downgraded my own overclaim, proof before trust, applied to the proof.
  • The live reconcile loop reached a certified majority and left a residual that is a set of genuine variety deficits, the honest bound made operational, not a bug.

I include this thesis with one honesty marker the others do not need. Its experiments are run against the project's own substrate, so its MATCH and UNVERIFIABLE verdicts are internal to that substrate, not external measurements against a criterion authored entirely outside the project the way C3's linear solve or the survey bound are.

The source is also not uniform about its own grand unification, and I report both readings rather than picking the flattering one. On its proposed unifying structure, the document points two ways: one passage still marks the structure as pure theory that its simulations cannot settle, listing it among what remains open; a later passage in the same document then claims that structure resolved and formalized and declares the arc complete. That is an internal inconsistency in the source: one passage calls it open and unsettleable by its own simulations, another calls it complete.

Given that tension, the honest move for this wrap-up is the conservative one, and I flag it as a deliberate underclaim. I do not endorse the unification as a proven result here, because the same document says its own simulations structurally cannot settle it, and because the claimed proof is internal to the project's substrate rather than checked against an outside criterion. So I treat the grand unification as the thesis's own conjecture, while recording that the source itself reaches a stronger resolved verdict elsewhere that I am choosing not to carry forward as bound. A reader who re-checks that passage and judges the formalization sound may upgrade it; I leave it as conjecture here.

Proven (within its substrate)
The master clause survived adversarial attack across sixteen experiments, with two self-overclaims caught and corrected; the substrate gained real organs driven by the experiments.
UNVERIFIABLE / conjectural
The cross-domain variety measure (the thesis concludes no single substrate-free measure exists, itself a result but not a positive measure); the unification structure, held here as conjecture, while disclosing that the source claims it resolved elsewhere.
Falsifier
A bounded class where the variety bound is met yet no rendering is comprehensible; a class where two perceivers cannot reach consistent verdicts against any external criterion; or compositional fidelity failing irreparably for a target domain. For my conservative reading: if an independent party re-derives the unification against a criterion authored outside the project's substrate, my conjecture label is refuted and the source's resolved verdict stands.

Why the boundary is the honest shape, not a weakness

Pull the studies together and the pattern is one pattern. A verdict binds where the criterion is externally witnessed and re-derivable, and reverts to a bid everywhere else.

Learning Forge
Binds on corpus provenance (seals re-derivable). Stays a bid on all six claim cards (no judge has run).
Discovery Forge C1
Binds when the dated survey data arrives. Stays a bid now, by construction (data not in hand).
Discovery Forge C3
Binds on the simulate leg (the linear solve is the criterion). Stays a bid on the render leg (no physical-chip measurement).
Discovery Forge C4
Binds in principle (each loop order is checkable). Stays a bid now (no symbolic engine in reach).
Research diet
Binds on transcript provenance and the physics itself. Stays a bid on every talk-to-design mapping (no oracle).
Viable-viz
Binds on within-substrate variety recovery. Stays a bid on the cross-domain measure and the unification structure.

Every row is the same boundary. The studies do not fight this. They are built to report it. A verification system that claimed to bind everywhere would be claiming aseity for its verdicts: it would be saying its MATCH stands on its own footing, needs no external criterion, holds in every domain. A self-standing verdict is as incoherent as a self-standing existence. The forges' refusal to fill empty slots, the Discovery Forge's UNVERIFIABLE on its convergence claims, the C3 record's split between the simulate leg and the render leg: these are not gaps to be closed by trying harder. They are the system being true about itself.

The reconcile thesis is the operational form of this truth. Perceive a shape. Check it against a criterion it did not author. Carry a re-checkable proof. A verdict that completes all three legs binds, and only in the domain where all three were available. The C3 simulate result completed all three and binds. The C3 render claim cannot complete the second leg, no external device measurement, and so does not bind, and the record says UNVERIFIABLE. There is no fourth move that would let a verdict bind without an external criterion, because there is no aseity to appeal to. The boundary is not where verification fails. It is where verification stops pretending.

What this thesis proves, and what it leaves open

Proven, bound, with evidence links:

  • The Learning Forge corpus is sealed and re-derivable; its six claim cards are grounded in hashed sources and all six verdict slots are honestly empty; of ten modules, five are solidly grounded, two partial, three ungrounded.
  • The Discovery Forge admits only cards with a named falsifier; it carries three anchored falsifiers (neutrino mass, the thermodynamic process, seven-loop quadratic gravity); the judge's refusal on all six convergence claims is sealed.
  • The C3 simulate leg is a real, re-checkable MATCH: 0.99 percent mean error and 3.3 percent covariance error against a five-percent bar, reproducible via the named script and sealed. This is the worked example of a bound verdict. (The recorded judge margins are the record's reported figures, not independently re-derived here.)
  • The research-diet transcripts are sealed and verified MATCH; the physics quoted is faithful.

Left open, UNVERIFIABLE, named honestly:

  • All six Learning Forge claim cards. No judge has adjudicated them.
  • Discovery Forge C1: undecidable until the three-to-four-year survey window's data is in hand.
  • Discovery Forge C3 render leg: no physical-chip measurement exists here. The math binds; the device claim does not.
  • Discovery Forge C4: out of current reach; no engine can carry a seven-loop perturbation calculation.
  • The method-template card, and every research-diet talk-to-design mapping: interpretive, no oracle, the judge refused.
  • The viable-visualization thesis's cross-domain variety measure and its unification structure: held here as the thesis's own conjecture, with the disclosure that the same document claims the structure resolved elsewhere. Not endorsed here as proven.

This thesis as a bid about itself. By Claim 1, this wrap-up is itself a made thing with no aseity. Its three claims bind for a reader who inhabits the practical standpoint of caring whether a verdict is grounded, and they are declinable by a reader who withdraws from that standpoint. Its evidence is the cited record paths and the seals quoted from them; if those records are altered or the seals do not re-derive, the corresponding claims here revert to bids. The thesis earns whatever bindingness it has the same way the forges do: by naming the external criterion it was checked against and inviting the re-check. Nothing here stands on its own footing, and that is the point it was written to make.

Method and honesty note

Written plainly, faithful to sources. Every section states what is proven and what is UNVERIFIABLE, with a falsification condition or an evidence link for each non-trivial claim. The four forge and diet records are quoted with anchors in the underlying draft so a reader can re-check the reading against the source. The viable-visualization thesis is included with an explicit marker that its verdicts are internal to its own substrate, and its grander unification is reported with both of the source's competing self-verdicts and held here as conjecture, to avoid overclaiming that this wrap-up rests on a settled structure. No analogy is invented: the bid-versus-bound structure is the conferred-existence thesis's own ground-versus-conferral figure and the reconcile thesis's own perceive-check-carry operation, applied to verdicts.

The load-bearing point bears one final restatement. A verdict has the same ontology as everything else in the conferred-existence thesis: conferred, relational, re-spoken, fully binding while the criterion is witnessed and re-derivable, and an unwitnessed bid the moment you step outside that domain. The forges already behave this way. C3 shows what it looks like when a verdict crosses, on one leg, from bid to bound. The empty slots and the UNVERIFIABLE marks are not the system falling short of verification. They are verification, told honestly.

Reviewable draft, 2026-06-30. Public-safe: no secrets, no client or private data. Specific seal hashes and absolute record paths are held in the underlying draft rather than published here.

A wrap-up thesis tying four recent experiments to the reconcile thesis. Every claim above is paired with a falsification condition or an evidence link. Back to Research.