All data

Scaling Laws Ledger Data Receipts

Public Draft v0.2 REAL DATA

8 parallel Opus research-agent sweeps yielded ~401 raw papers, deduplicated and hand-arbitrated to 302 unique. Bills 5, 8, 11 ★ NO CLEAN TRIGGER YET (0 clean triggers each). Rebuttal density 25.8%.

Receipts

ArtifactLinkPurpose
Bill definitionsbills_draft.md13 bills + 6 meta-costs + 3 escape gates + ★ 5, 8, 11 empty-space verification with real fire counts
Threat modelpurpose.mdThreat model, scope, empty-space hypothesis, cousin-ledger coupling
Corpus union JSON_batch_1_union.json302 unique papers (deduplicated from ~401 raw across 8 sweeps), with full metadata
Classifierbill_classifier.pyRegex rule engine + hand-arbitration. Run with --arbitrate-union
Aggregatoraggregate_batch_1.pyDeduplicates raw sweep JSONs into the corpus union
READMEREADME.mdReproducibility README with run order

Real fire counts

BillCands.CleanRebuttalsGated
1 — Data-mixture conditioning audit504451
2 — Tokenizer-drift / vocab-size audit352033
3 — Cross-architecture replication725691
4 — Inverse-scaling subset audit5320
5 ★ Causally-faithful scaling-law mechanism190190
6 — Test-time-compute decomposition5500
7 — Hyperparameter-transfer audit261025
8 ★ Cross-data-mixture generalization12075
9 — Vendor-claim half-life / temporal-trajectory audit121110
10 — Emergence-as-mirage decomposition1010
11 ★ Universal scaling-law cross-architecture590336
12 — Anti-saturation construction1100
13 — Distilled-cousin reproduction3300

Public draft v0.2 (2026-05-09). Sweep JSONs live in the source repo at ProjectForty2 public evidence bundle: scaling_laws/deep_loops/. Target v0.3 lock 2026-Q3.