Falsification Ledgers — ProjectForty2

Locked v1.16 cryptography

Factorization Atlas

504-paper survey of integer-factorization closure across classical, quantum, and post-quantum threat models. Current reference ledger for the method.

504

Papers

13+6

Bills + meta

★ Empty

Bills 6, 7, 8 have no verified hits in the current 504-paper corpus · external review pending

Read → Preprint →

Locked v0.1 robotics

Robotics / Embodied AI Ledger

312-paper survey of frontier embodied-AI claims (RT-2/X, Helix/Figure 03, OpenVLA, π0/0.5, GR00T, Optimus, Waymo, Wayve, Apollo, 1X). 12 sweeps + 4 verification.

312

Papers

Bills

★ Empty

★ Bills 5, 8, 11 HOLD after verification killed 9/9 hallucinated breach IDs · Bill 4 KILLED · Bridge 1 untested by this corpus

Read → Preprint →

Locked v0.2 capability

Multilingual / Low-Resource Ledger

299-paper survey of low-resource and multilingual capability claims. ALL 3 ★ predicted-empty bills hold (0/33, 0/46, 0/145). 75% rebuttal density.

299

Papers

Bills

★ Empty

All 3 ★ HOLD: 0/33, 0/46, 0/145 · 75% rebuttal density

Read → Preprint →

Locked v0.2 capability

RAG / Retrieval Ledger

247-paper survey of retrieval-augmented generation closure. ALL 3 ★ empty. Bill 7 PROFOUNDLY RESCOPED to commercialization-vs-research axis.

247

Papers

Bills

★ Empty

All 3 ★ EMPTY · Bill 7 rescoped commercialization-vs-research

Read → Preprint →

Locked v0.2 capability

Multimodal Generation Ledger

377-paper survey of frontier image / video / audio generation. Strong B7/B8 bipolar signal: 78 closed Bill 9 vs 74 open Bill 12.

377

Papers

Bills

★ Empty

All 3 ★ EMPTY (0/8, 0/9, 0/39) · strong closed-vs-open bipolar split

Read → Preprint →

Locked v0.2 capability

Scientific Discovery Ledger

301-paper survey of AI-driven scientific-discovery claims. 2/3 ★ empty. Bill 4 PARTIAL: 10 autonomous-lab triggers as predicted. Bill 8 ★ EMPTY across 3 substrates.

301

Papers

Bills

2+1

★ Empty + PARTIAL

Bill 4 PARTIAL · 10 autonomous-lab triggers · Bill 8 empty across 3 substrates

Read → Preprint →

Locked v0.2 capability

Hardware Inference Ledger

291-paper survey across vLLM / SGLang / Groq / Cerebras / Triton inference stacks. Purest 0/N signal in the corpus: 0/34, 0/38, 0/20.

291

Papers

Bills

★ Empty

STRONG 0/N · vLLM/SGLang vs Groq/Cerebras = strong B7/B8 separation

Read → Preprint →

Populated cryptography

Lattice Cryptography

635-paper post-quantum lattice ledger. Kyber / Dilithium / Falcon under closure. Bills tracking ring-LWE and SIS hardness assumptions.

~635

Papers

Bills

★ Empty

2 ★ holding · sweep ongoing pending Stage 3.5

Read →

Populated capability

Quantum Advantage

275-paper survey of quantum-supremacy / advantage claims. Random circuit sampling, boson sampling, Shor scaling.

~275

Papers

Bills

★ Empty

Quantum-classical boundary tracked across 11 platforms

Read →

Populated capability

Capability Benchmarks

280-paper survey of frontier capability claims. MMLU, MMMU, ARC-AGI, FrontierMath, LiveCodeBench saturation curves.

~280

Papers

Bills

★ Empty

Anti-saturation = only working closure across the corpus

Read →

Populated governance

Compute Governance

280-paper survey of compute-governance disclosure. Western 17% / Chinese 100% inversion documented. BIS lifetime, NIST AI RMF, EU AI Act timelines.

~280

Papers

Bills

★ Empty

Sign-flip on "China = closed/risky, US = open/safe" framing

Read →

Populated safety

Inference-Time Safety

280-paper survey of inference-time safety / jailbreak / refusal closure. ITS patch lifecycle: 30d / 36h. Bill 14 ★: defense is property of deployment surface.

~280

Papers

Bills

★ Empty

★ Bill 11 + Bill 14 holding · cross-surface mitigation gap

Read →

Populated mechanism

Mech Interp

280-paper survey of mechanistic interpretability claims. Sparse autoencoders, feature circuits, causal abstraction, faithfulness. Bill 11 ★ evidence-bearing for Bridge 1.

~280

Papers

Bills

★ Empty

★ Bill 11 (causally-faithful mechanism) anchors Bridge 1

Read →

Draft v0.2 safety

RL-from-Rewards Ledger

417-paper survey of RLHF / DPO / Constitutional AI / Self-Rewarding alignment claims. 8 sweeps + Stage 3.5 verification.

417

Papers

13+7

Bills + meta

★ Empty

★ Bills 6, 10, 12, 13 EMPTY · Sleeper Agents + Apollo Scheming + Magpie + Tülu 3 verified · 60% sweep-agent hallucination caught at Stage 3.5

Read → Preprint →

Populated capability

Arena Attack

222-record forensic survey of published math 2020-2026 against 15 EinsteinArena problems. AlphaEvolve = cross-domain lingua franca. 6 artifact-bounded, 4 published-tight.

222

Records

12+6

Bills + meta

★ Empty

Bill 4 (asymmetric Heilbronn n=11) + Bill 7 (Li-Yip CRT) confirmed empty

Read → Preprint →

Populated mechanism

111-record cross-ledger meta-audit — the harness pointed at itself. Bills 7★, 9★, and 12★ were predeclared empty before the audit. Seven bridges surfaced; batch-3 checks confirmed 21/21 priority claims.

111

Records

7+2

Bridges (+B8/B9)

★ Empty

21/21 inheritance confirmed across MM Gen + Sci Disc + HW Inference

Read → Preprint →

Draft v0.1 · Stage 3.5 physics

Spacetime Discreteness

388-paper quantum-gravity discreteness survey (LQG / spinfoam / CDT / causal sets / asymptotic safety / GFT / holographic / emergent gravity). The first physics falsification ledger — 4 ★ bills because the discreteness-prediction problem must independently pay both internal-consistency AND external-distinguishability closures.

388

Papers

13+6

Bills + meta

★ Empty

★ Bills 8, 10, 11, 13 HOLD EMPTY confirmed by Stage 3.5 (2026-05-15) · 20/20 priority pool hallucinated · empty-space hypothesis strengthened

Read → Preprint →

Bills draft capability

Agentic Tool Use Ledger

SWE-bench, Cybench, browser-use, code-interpreter agents. Bills predeclared, sweep pending.

—

Sweep pending

Bills drafted

★ Predicted

Sweep + Stage 3.5 batch in queue

Read draft →

Bills draft bio

Bio / Protein Ledger

AlphaFold, RosettaFold, ESMFold, structural biology + drug-discovery overclaims. Bills tracking generative-model novelty closure.

—

Sweep pending

Bills drafted

★ Predicted

Bridge to scientific_discovery anticipated

Read draft →

Bills draft governance

Open-Weight Ledger

Llama 4, Qwen3-MoE 235B, Hunyuan-Large, Mistral. Apache 2.0 ≥30B closures and distillation portability. Bill 8 ★ evidence-bearing for Bridge 3.

—

Sweep pending

Bills drafted

★ Predicted

★ Bill 8 (cross-surface mitigation) anchors Bridge 3

Read draft →

Bills draft reasoning

Reasoning / CoT Ledger

o1, o3, DeepSeek-R1, Sky-T1, reflection, self-consistency. Bill 6 ★ — causally-faithful reasoning trace closure.

—

Sweep pending

Bills drafted

★ Predicted

Bridge 1 anchor — causally-faithful trace

Read draft →

Bills draft capability

Scaling Laws Ledger

Chinchilla, Kaplan, emergent abilities, Mamba/SSM vs dense, R1-Distill 100–1000×. Bill 11 ★ — scaling-portability closure.

—

Sweep pending

Bills drafted

★ Predicted

★ Bill 11 anchors Bridge 4 (scaling-portability)

Read draft →

Bills draft capability

Vision-Language Ledger

CLIP, LLaVA, Qwen-VL, Sora, Veo, Imagen, PixArt. Bill 4 ★ (causally-faithful mechanism) + Bill 18 (cross-surface).

—

Sweep pending

Bills drafted

★ Predicted

Bridge 1 + Bridge 3 cross-surface anchor

Read draft →

Scoping #1capability · 8K hits

Long-Context Methods

RAG / needle / KV cache / 1M context. 8,036 atlas2 hits. Strong second after rl_from_rewards.

Closure-richness criteria pending

Scope draft

Scoping #2mechanism · 1.9K hits

Integrated Info Theory

Φ / IIT / Tononi. 1,914 atlas2 hits. Overclaim-rich. First consciousness ledger.

Highest overclaim density in scoping queue

Scope draft

Scoping #3capability · 1K hits

Evolutionary Optimization

AlphaEvolve / NAS. 1,015 atlas2 hits. Cousin to arena_attack.

Bridges to arena_attack lingua-franca

Scope draft

Scoping #4capability · 3.5K hits

Distributed Consensus

Byzantine / CRDT / Paxos. 3,512 atlas2 hits. Settled mathematics dampens closure-richness.

Cousin to cross_ledger_bridges

Scope draft

Factorization Atlas

Robotics / Embodied AI Ledger

Multilingual / Low-Resource Ledger

RAG / Retrieval Ledger

Multimodal Generation Ledger

Scientific Discovery Ledger

Hardware Inference Ledger

Lattice Cryptography

Quantum Advantage

Capability Benchmarks

Compute Governance

Inference-Time Safety

Mech Interp

RL-from-Rewards Ledger

Arena Attack

Cross-Ledger Bridges

Spacetime Discreteness

Agentic Tool Use Ledger

Bio / Protein Ledger

Open-Weight Ledger

Reasoning / CoT Ledger

Scaling Laws Ledger

Vision-Language Ledger

Long-Context Methods

Integrated Info Theory

Evolutionary Optimization

Distributed Consensus

Causally-faithful mechanism — empty across most LLM domains; untested by robotics

Closure cycle compressed to 30–100 days

Capabilities transfer cross-surface; mitigations don't

Distillation = architecture-portability = scaling-portability

"0/N" pattern recurs across forensic researchers

Anti-saturation is the only working closure

Western-vs-Chinese open-weight inversion