Working paper draft
Hyphal Context Benchmark for Receipt Routing
A deterministic fixture for receipt-first research context.
Zain Dana Harper/ Seattle · 2026/ draft · not archive-submitted/ research index
Status
This page is website copy for a working paper draft. The benchmark result is HYPHAL_CONTEXT_FIXTURE_MATCH for one deterministic fixture only. It compares a full-context route against a gradient-envelope-and-receipt route over the frozen biology/network-intelligence corpus. Model answer quality is NOT_MEASURED. General route superiority is UNVERIFIABLE. A native BuildLang/buildc receipt is NOT_REPLAYED.
Claim ledger
- Benchmark fixture
- HYPHAL_CONTEXT_FIXTURE_MATCH. The hyphal route recovered the same required evidence classes and guardrails as the full-context route with fewer estimated prompt tokens.
- Full-context route
- MATCH. Sends all ten source bodies plus the source gate and seed note. Estimated prompt tokens: 123413.
- Hyphal route
- MATCH. Sends ten gradient envelopes and rehydrates six evidence cards by receipt. Estimated prompt tokens: 1338.
- Guardrails
- MATCH. Both routes block biological nervous-system equivalence, universal intentional common-mycorrhizal-network messaging, and overpromoting the protocol beyond the fixture.
- Tooling receipts
- MATCH / VERIFIED. Crucible assessed the bounded thesis as MATCH 3 / DRIFT 0 / UNVERIFIABLE 0. Learn reverified the prooflesson receipt as VERIFIED.
- Scope
- UNVERIFIABLE outside fixture. The result does not prove that hyphal routing wins generally or that a model answer improves.
Measured fixture
- Token savings
- 122075 estimated prompt tokens.
- Token savings ratio
- 0.9892 of the full-context estimate.
- Evidence recall delta
- 0 across the six required evidence classes.
- Guardrail delta
- 0 across the three bounded guardrails.
Required evidence
- fungal_signal
- plant_signal
- network_evidence
- overclaim_boundary
- source_availability
- architecture_seed
Benchmark relation
hyphal.required_class_recall == full.required_class_recall
hyphal.guardrail_recall == full.guardrail_recall
hyphal.estimated_prompt_tokens < full.estimated_prompt_tokens
The current receipt verifies this relation in JavaScript over one frozen corpus. The next version should replay the relation in BuildLang/buildc and add a model answer-quality phase.
Receipts
- Benchmark CLI: demo/hyphal-context-benchmark.mjs.
- Benchmark receipt: hyphal-context-benchmark-2026-07-02.json.
- Crucible run: twenty-second-wave-hyphal-context-run-2026-07-02.json.
- Crucible report: twenty-second-wave-hyphal-context-report-2026-07-02.md.
- Learn prooflesson: twenty-second-wave-hyphal-context-benchmark.prooflesson.json.
- Official copy: HYPHAL-CONTEXT-BENCHMARK-FOR-RECEIPT-ROUTING-2026-07-02.md.
Do not infer
- Do not infer that this fixture is a general context-routing theorem.
- Do not infer that estimated prompt tokens equal provider billing tokens.
- Do not infer that model answer quality has been measured here.
- Do not infer that the benchmark changes the biology source boundaries.
- Do not infer that this relation already has a native BuildLang/buildc receipt.
Local source draft: docs/research/whitepapers/HYPHAL-CONTEXT-BENCHMARK-FOR-RECEIPT-ROUTING-2026-07-02.md. Current page status: draft website copy. Updated 2026-07-02.