Section 06: Expand Fixtures + Self-Test

Status: Not Started Goal: The diagnostic toolkit’s self-test suite runs against only 3 basic fixtures (simple.ori, clean.ori, chain.ori). These don’t exercise closures, iterators, nested structures, generics, trait dispatch, or failure modes — the exact code patterns that cause the most debugging churn. New fixtures ensure diagnostic scripts produce correct output for the patterns they’ll actually be used to debug. The fixture suite must also cover escape closures, ? unwinding, recursive tree walks, COW sharing, large aggregates, and mixed sum types — all identified as blind spots by tp-help consensus.

Success Criteria:

At least 13 new .ori fixture files in diagnostics/fixtures/ — 14 new fixtures created
Each fixture exercises a distinct code pattern relevant to AOT/AIMS debugging
Fixtures categorized as pass (exit 0, clean RC), aims-heavy (exit 0, exercises AIMS-specific paths like COW/reuse), or expected-fail (exit non-zero, validates diagnostic detection)
self-test.sh runs new fixtures through diagnose-aot.sh, dual-exec-debug.sh, rc-stats.sh, ir-dump.sh, arc-dump.sh, and bisect-passes.sh (added by Section 05)
bisect-passes.sh exercised on at minimum closure.ori, iterator_break.ori, and generic_mono.ori (the AIMS-relevant fixtures) — all 13 pass/aims-heavy fixtures run through bisect-passes
Self-test assertions are feature-specific — not just “non-empty output” but assertions on expected IR markers (e.g., PartialApply for closures, Switch for match, RcInc/RcDec for RC-heavy fixtures)
Expected-fail fixtures use run_test_expect_fail with explicit exit code assertions distinguishing leak vs crash vs mismatch
All fixtures verified under both debug and release builds (cargo b and cargo b --release)
Satisfies mission criterion: “7+ new diagnostic fixtures covering closures, iterators, nested structures, generics, trait dispatch, and failure modes”

Context: The current 3 fixtures (simple.ori — no collections/RC; clean.ori — collections, balanced RC; chain.ori — chained COW) were adequate when the toolkit was first built. But ARC/AIMS bugs predominantly appear in closure captures, iterator early-exit cleanup, nested aggregate drops, generic instantiation, and trait method dispatch — none of which are exercised. A diagnostic regression in these areas ships behind a green self-test.

Depends on: Section 05 (bisect-passes.sh must exist for self-test integration).

README ownership: Section 07 owns the diagnostics/README.md fixtures table update (see section-07-integration.md 07.4). This section creates the fixtures and the FIXTURES.md categorization file; Section 07 integrates the final table into the user-facing README.

06.1 Create core-pattern fixtures

File(s): diagnostics/fixtures/*.ori (new files)

Each fixture must: (1) compile under AOT, (2) produce deterministic output via exit code (0 = success, 1 = logic failure), (3) exercise a specific code pattern, (4) pass both ori run and AOT binary execution with identical results. Fixture names are descriptive of the pattern, not the section number. Reference existing test files in tests/valgrind/fat_matrix/ for correct Ori syntax patterns.

Category: pass — all exit 0, balanced RC.

06.2 Create ARC-interaction fixtures

File(s): diagnostics/fixtures/*.ori (new files)

These fixtures exercise ARC-specific interaction patterns that tp-help identified as blind spots. They are pass fixtures (exit 0) but are categorized as aims-heavy because they specifically stress AIMS pipeline phases.

Category: aims-heavy — all exit 0, but exercise AIMS-specific paths (COW, reuse, ? unwinding, recursion).

06.3 Create expected-fail fixtures

File(s): diagnostics/fixtures/*.ori (new files)

tp-help identified that failure fixtures were “optional and underspecified” — this is a coverage gap. Diagnostic scripts must be validated in failure mode, not just success mode. These fixtures are mandatory.

Category: expected-fail — designed to trigger specific diagnostic failures.

leak.ori — Program that intentionally leaks an RC value (e.g., create a circular reference or allocate without drop path). ORI_CHECK_LEAKS=1 must report a leak. diagnose-aot.sh must detect the leak. This validates that the leak detection path in diagnostic scripts actually works.
- Safe Ori code cannot create true RC leaks (no circular references, ARC manages all allocations). Created best-effort fixture: panic with fat values in scope causes diagnose-aot.sh to report FAIL (execution exit=1) + WARN (RC Stats imbalanced: over-releases from incomplete cleanup). ORI_CHECK_LEAKS=1 does not report leaks because the panic handler bypasses ori_run_main’s return path where the leak check runs.
mismatch_compute.ori — Program that (via the mismatch-wrapper.sh infrastructure already in diagnostics/fixtures/) produces different interpreter vs AOT output. This validates that dual-exec-debug.sh correctly detects and reports mismatches with auto-diagnostic output. Note: The existing mismatch.ori + mismatch-wrapper.sh already serves this purpose — verify it is sufficient or extend it.
- Verified: existing mismatch.ori + mismatch-wrapper.sh is sufficient. ORI_BIN=mismatch-wrapper.sh dual-exec-debug.sh mismatch.ori correctly detects MISMATCH (stdout “INTERP” vs “AOT”), exits 1, and produces auto-diagnostic output. No separate mismatch_compute.ori needed.
Subsection close-out (06.3) — MANDATORY before starting 06.4:
- All tasks above are [x] and verified
- Update this subsection’s status in section frontmatter to complete
- Run /improve-tooling retrospectively on THIS subsection

06.4 Fixture matrix and categorization

File(s): diagnostics/fixtures/FIXTURES.md (new file)

tp-help identified scattered fixture knowledge as a LEAK — fixture names are repeated per-script in self-test with no single source of truth for what each fixture covers. This subsection creates the SSOT.

Create diagnostics/fixtures/FIXTURES.md with a categorization table: Created with 18 fixtures (11 pass, 5 aims-heavy, 2 expected-fail) plus build-fail-parse.ori and mismatch-wrapper.sh infra entries. Includes full matrix table matching the plan specification. Also added infra category for supporting infrastructure files.
In FIXTURES.md, document the self-test contract for each category:
- pass: ir-dump.sh (non-empty), arc-dump.sh (non-empty), diagnose-aot.sh (exit 0), dual-exec-debug.sh (MATCH), rc-stats.sh (produces output), bisect-passes.sh --rc-only (phase table + “Leak check: clean”)
- aims-heavy: same as pass, PLUS bisect-passes.sh --rc-only shows non-zero RC ops, AND feature-specific IR marker assertions
- expected-fail: diagnose-aot.sh / dual-exec-debug.sh must report failure, specific exit code + output pattern documented per fixture
Subsection close-out (06.4) — MANDATORY before starting 06.5:
- All tasks above are [x] and verified
- Update this subsection’s status in section frontmatter to complete
- Run /improve-tooling retrospectively on THIS subsection

06.5 Update self-test.sh coverage

File(s): diagnostics/self-test.sh

06.R Third Party Review Findings

[TPR-06-001-codex][high] section-06-fixtures.md:142 — Centralize fixture categories to remove LEAK and DRIFT. generic_mono.ori inconsistency, self-test.sh as second registry. Resolved: Fixed on 2026-04-10. Moved generic_mono.ori to 06.2 (aims-heavy), added SSOT note to 06.5 fixture list requiring FIXTURES.md cross-reference.
[TPR-06-002-codex][medium] section-06-fixtures.md:48 — Add large aggregate coverage promised by the goal. Resolved: Fixed on 2026-04-10. Added large_aggregate.ori fixture to 06.2 with >16B struct pattern and IR assertion.
[TPR-06-003-codex][medium] section-06-fixtures.md:200 — Complete expected-fail matrix with exact exit-code assertions. Resolved: Fixed on 2026-04-10. Added mismatch_compute.ori to FIXTURES.md table, replaced generic run_test_expect_fail with specific exit code + output pattern assertions.
[TPR-06-001-gemini][medium] section-06-fixtures.md:195 — Add mismatch_compute.ori to FIXTURES.md table. Resolved: Fixed on 2026-04-10. Same fix as [TPR-06-003-codex].
[TPR-06-002-gemini][low] section-06-fixtures.md:79 — Harmonize generic_mono.ori categorization. Resolved: Fixed on 2026-04-10. Same fix as [TPR-06-001-codex] — moved to 06.2 aims-heavy.
[TPR-06-003-gemini][medium] section-06-fixtures.md:214 — Use —rc-only flag for bisect-passes self-test assertions. Resolved: Fixed on 2026-04-10. Updated 06.5 to specify --rc-only flag and explain why it’s load-bearing.
[TPR-06-004-gemini][low] section-06-fixtures.md:180 — Correct bisect-passes coverage for simple.ori in SSOT table. Resolved: Fixed on 2026-04-10. Changed simple.ori bisect-passes from “No (trivial)” to “Yes”.
[TPR-06-005-gemini][medium] section-06-fixtures.md:225 — Exercise leak.ori with bisect-passes.sh to verify detection. Resolved: Fixed on 2026-04-10. Added leak.ori to bisect-passes coverage with exit 1 assertion, updated table.

Section 06: Expand Fixtures + Self-Test

06.1 Create core-pattern fixtures

06.2 Create ARC-interaction fixtures

06.3 Create expected-fail fixtures

06.4 Fixture matrix and categorization

06.5 Update self-test.sh coverage

06.R Third Party Review Findings

06.N Completion Checklist