Working paper draft

Hyphal Context Benchmark for Receipt Routing

A deterministic fixture for receipt-first research context.

Zain Dana Harper/ Seattle · 2026/ draft · not archive-submitted/ research index

Status

This page is website copy for a working paper draft. The benchmark result is HYPHAL_CONTEXT_FIXTURE_MATCH for one deterministic fixture only. It compares a full-context route against a gradient-envelope-and-receipt route over the frozen biology/network-intelligence corpus. Model answer quality is NOT_MEASURED. General route superiority is UNVERIFIABLE. A native BuildLang/buildc receipt is NOT_REPLAYED.

Claim ledger

Benchmark fixture
HYPHAL_CONTEXT_FIXTURE_MATCH. The hyphal route recovered the same required evidence classes and guardrails as the full-context route with fewer estimated prompt tokens.
Full-context route
MATCH. Sends all ten source bodies plus the source gate and seed note. Estimated prompt tokens: 123413.
Hyphal route
MATCH. Sends ten gradient envelopes and rehydrates six evidence cards by receipt. Estimated prompt tokens: 1338.
Guardrails
MATCH. Both routes block biological nervous-system equivalence, universal intentional common-mycorrhizal-network messaging, and overpromoting the protocol beyond the fixture.
Tooling receipts
MATCH / VERIFIED. Crucible assessed the bounded thesis as MATCH 3 / DRIFT 0 / UNVERIFIABLE 0. Learn reverified the prooflesson receipt as VERIFIED.
Scope
UNVERIFIABLE outside fixture. The result does not prove that hyphal routing wins generally or that a model answer improves.

Measured fixture

Token savings
122075 estimated prompt tokens.
Token savings ratio
0.9892 of the full-context estimate.
Evidence recall delta
0 across the six required evidence classes.
Guardrail delta
0 across the three bounded guardrails.

Required evidence

  • fungal_signal
  • plant_signal
  • network_evidence
  • overclaim_boundary
  • source_availability
  • architecture_seed

Benchmark relation

hyphal.required_class_recall == full.required_class_recall
hyphal.guardrail_recall == full.guardrail_recall
hyphal.estimated_prompt_tokens < full.estimated_prompt_tokens

The current receipt verifies this relation in JavaScript over one frozen corpus. The next version should replay the relation in BuildLang/buildc and add a model answer-quality phase.

Receipts

  • Benchmark CLI: demo/hyphal-context-benchmark.mjs.
  • Benchmark receipt: hyphal-context-benchmark-2026-07-02.json.
  • Crucible run: twenty-second-wave-hyphal-context-run-2026-07-02.json.
  • Crucible report: twenty-second-wave-hyphal-context-report-2026-07-02.md.
  • Learn prooflesson: twenty-second-wave-hyphal-context-benchmark.prooflesson.json.
  • Official copy: HYPHAL-CONTEXT-BENCHMARK-FOR-RECEIPT-ROUTING-2026-07-02.md.

Do not infer

  • Do not infer that this fixture is a general context-routing theorem.
  • Do not infer that estimated prompt tokens equal provider billing tokens.
  • Do not infer that model answer quality has been measured here.
  • Do not infer that the benchmark changes the biology source boundaries.
  • Do not infer that this relation already has a native BuildLang/buildc receipt.

Local source draft: docs/research/whitepapers/HYPHAL-CONTEXT-BENCHMARK-FOR-RECEIPT-ROUTING-2026-07-02.md. Current page status: draft website copy. Updated 2026-07-02.