The Negative Alignment Tax

What if structured safety improved selected capability metrics?

In the runs reported here, safety-oriented adapters improved both selected safety and capability measurements. The result is scoped to these models, tasks, and metrics; external replication is still the next bar.

Results

Metric Result
Safety improvement +2.3%
Capability improvement +2.3%
Deception detection accuracy 92.9%
Drift detection signal ratio 652×
Architectures validated 6+
Tests passed 261
Not a tradeoff. A compound effect. Both metrics improve simultaneously.

What We Built

The Cognitive Kernel

Composable epistemic adapters designed to steer recurring reasoning patterns across tested open-weight transformer families.

CHRONOS

39 MCP tools for real-time AI behavior analysis.

Proof Points

2
Open-Weight Models
39
MCP Tools
261
Tests Passed

Models: ProjectForty2/dont_panic (large open-weights base), ProjectForty2/ford_prefect (small open-weights base) on HuggingFace
Validation: Full MMLU (570 questions), TruthfulQA (+33% hallucination resistance), cross-architecture replication

Engagement Options

Acquisition

Full IP transfer with continued development inside a frontier lab

Research Partnership

Joint development with attribution and shared findings

Licensing

Methodology access for internal deployment and research