Show & Tell: Fixing the Safety Module — From Auto-Prove to Sound Logic #26

web3guru888 · 2026-04-11T16:49:36Z

web3guru888
Apr 11, 2026
Maintainer

The Problem

Issue #7: the formal_verification module was auto-proving every ethical constraint regardless of whether the premises actually entailed the conclusion. A safety system that certifies everything as safe is worse than no safety system at all.

We traced 5 distinct root causes and fixed all of them in ce0e3f0.

Root Cause Breakdown

1. Bare `except` → Opaque Symbol Wrapping

The original _parse_formula used a bare except clause. On any parse failure, it returned sp.Symbol(formula) — wrapping the entire formula string as a SymPy symbol. If premise and conclusion were the same string, they were literally the same symbol and premise → conclusion was trivially true.

Fix: Replaced with a shared parse_logic_formula() that raises FormulaParseError on failure. No silent degradation.

2. All 8 Ethical Axioms Parsed to `sp.true`

EthicalAxiom._parse_formula was missing the '->' → '>' operator replacement. SymPy uses >> for implication; without the substitution, A -> B failed to parse and became sp.true. All axioms were literally True — adding zero constraint.

Fix: Delegates to the shared parse_logic_formula() with correct operator replacement.

3. Ungrounded Conclusion Symbols

The prover was checking premises ⊢ satisfies_safety_constraint_xyz where satisfies_safety_constraint_xyz is a fresh symbol with no logical connection to the premises. Naturally, satisfiability was always found.

Fix: The prover now uses the constraint's actual formal_specification (e.g. ~causes_harm) as the conclusion.

4. Model Checking Only Checked 2 Models

_prove_by_model_checking checked only {all vars = True} and {all vars = False}. Theorems that hold under only those two assignments would be spuriously proven.

Fix: Now exhaustively enumerates all 2^n truth assignments via itertools.product. Falls back to SymPy's SAT solver for n > 20 variables.

5. String-Template Natural Deduction

The natural deduction engine matched rules via fragile string patterns like [['A', 'A -> B'], 'B']. Any formatting difference broke the match.

Fix: Symbolic forward-chaining using SymPy Implies pattern matching — modus ponens, modus tollens, hypothetical syllogism — working on the actual expression tree.

72 New Tests Added

parse_logic_formula: operators, quantifiers, registry, error handling (19 tests)
Auto-prove blocking: ungrounded, unrelated, empty premises (8 tests)
Valid entailments: modus ponens/tollens, syllogisms, contrapositive (12 tests)
Invalid fallacies: affirming consequent, denying antecedent (7 tests)
Contradiction handling: ex falso blocked (3 tests)
Axiom parsing: not sp.true, has free symbols (4 tests)
Model checking: exhaustive enumeration, counterexample detection (4 tests)
Ethical engine: harmful rejected, safe accepted (8 tests)
Natural deduction symbolic chains (5 tests)

Current Status

With both IIT Φ (693742e) and formal verification (ce0e3f0) fixed:

3,236 passed · 25 skipped · 0 failed

The two most dangerous known bugs are resolved. The homomorphic module (Issue #8) is next.

Open Question for Contributors

The model checker currently falls back to SymPy's SAT solver at n=20 variables. For a production safety system we have two paths:

Z3 SMT solver — much faster, handles quantified formulas, excellent Python bindings (pip install z3-solver)
Lean 4 / Coq — proof certificates, machine-checkable, but steep integration cost

Which direction makes more sense for ASI:BUILD's use case? Comments welcome — especially if you've worked with either backend.

Code: formal_verification.py · test_formal_verification_fix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show & Tell: Fixing the Safety Module — From Auto-Prove to Sound Logic #26

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Show & Tell: Fixing the Safety Module — From Auto-Prove to Sound Logic #26

Uh oh!

web3guru888 Apr 11, 2026 Maintainer

The Problem

Root Cause Breakdown

1. Bare except → Opaque Symbol Wrapping

2. All 8 Ethical Axioms Parsed to sp.true

3. Ungrounded Conclusion Symbols

4. Model Checking Only Checked 2 Models

5. String-Template Natural Deduction

72 New Tests Added

Current Status

Open Question for Contributors

Replies: 0 comments

web3guru888
Apr 11, 2026
Maintainer

1. Bare `except` → Opaque Symbol Wrapping

2. All 8 Ethical Axioms Parsed to `sp.true`