All data

Biology / Protein Folding Ledger Data Receipts

Public Draft v0.2 REAL DATA

8 parallel Opus research-agent sweeps yielded ~322 raw papers, deduplicated and hand-arbitrated to 283 unique. Bills 4, 7, 10 ★ NO CLEAN TRIGGER YET (0 clean triggers each). Rebuttal density 13.8%.

Receipts

ArtifactLinkPurpose
Bill definitionsbills_draft.md13 bills + 6 meta-costs + 3 escape gates + ★ 4, 7, 10 empty-space verification with real fire counts
Threat modelpurpose.mdThreat model, scope, empty-space hypothesis, cousin-ledger coupling
Corpus union JSON_batch_1_union.json283 unique papers (deduplicated from ~322 raw across 8 sweeps), with full metadata
Classifierbill_classifier.pyRegex rule engine + hand-arbitration. Run with --arbitrate-union
Aggregatoraggregate_batch_1.pyDeduplicates raw sweep JSONs into the corpus union
READMEREADME.mdReproducibility README with run order

Real fire counts

BillCands.CleanRebuttalsGated
1 — PDB contamination audit222200
2 — Sequence-similarity leakage5500
3 — Designable-target audit151500
4 ★ Causally-faithful structure-prediction mechanism200020
5 — Cross-fold-method generalization121200
6 — Disordered-region / IDR audit5500
7 ★ Cross-organism / cross-fold-class generalization130013
8 — Functional-assay validation4400
9 — Held-out post-2024 PDB target set4400
10 ★ Wet-lab independent reproduction4903415
11 — Dual-use synthesis-screening audit454410
12 — Vendor-self-eval independence5500
13 — Test-time-search / co-evolution-search decomposition4400

Public draft v0.2 (2026-05-09). Sweep JSONs live in the source repo at ProjectForty2 public evidence bundle: bio_protein/deep_loops/. Target v0.3 lock 2026-Q3.