Journey 3: “I am recursive”

Source

// Journey 3: "I am recursive"
// Slug: recursion
// Difficulty: simple
// Features: recursion, comparison, arithmetic, function_calls
// Expected: fib(10) + gcd(48, 18) = 55 + 6 = 61

@fib (n: int) -> int =
    if n <= 1 then n
    else fib(n: n - 1) + fib(n: n - 2);

@gcd (a: int, b: int) -> int =
    if b == 0 then a
    else gcd(a: b, b: a % b);

@main () -> int = {
    let f = fib(n: 10);        // = 55
    let g = gcd(a: 48, b: 18); // = 6
    f + g                      // = 61
}

Execution Results

Backend	Exit Code	Expected	Stdout	Stderr	Status
Eval	61	61	(none)	(none)	PASS
AOT	61	61	(none)	(none)	PASS

Compiler Pipeline

1. Lexer

The lexer (tokenizer) breaks raw source text into a stream of tokens — the smallest meaningful units like keywords, identifiers, operators, and literals.

Tokens: 125 | Keywords: 12 | Identifiers: 24 | Errors: 0

Token stream

Fn(@) Ident(fib) LParen Ident(n) Colon Ident(int)
RParen Arrow Ident(int) Eq If Ident(n) LtEq
Lit(1) Then Ident(n) Else Ident(fib) LParen Ident(n)
Colon Ident(n) Minus Lit(1) RParen Plus Ident(fib)
LParen Ident(n) Colon Ident(n) Minus Lit(2) RParen Semi
Fn(@) Ident(gcd) LParen Ident(a) Colon Ident(int)
Comma Ident(b) Colon Ident(int) RParen Arrow Ident(int)
Eq If Ident(b) EqEq Lit(0) Then Ident(a) Else
Ident(gcd) LParen Ident(a) Colon Ident(b) Comma Ident(b)
Colon Ident(a) Percent Ident(b) RParen Semi
Fn(@) Ident(main) LParen RParen Arrow Ident(int) Eq
LBrace Let Ident(f) Eq Ident(fib) LParen Ident(n)
Colon Lit(10) RParen Semi Let Ident(g) Eq Ident(gcd)
LParen Ident(a) Colon Lit(48) Comma Ident(b) Colon
Lit(18) RParen Semi Ident(f) Plus Ident(g) RBrace

2. Parser

The parser transforms the flat token stream into a hierarchical Abstract Syntax Tree (AST) — a tree structure that represents the grammatical structure of the program.

Nodes: 38 | Max depth: 5 | Functions: 3 | Errors: 0

AST (simplified)

Module
├─ FnDecl @fib
│  ├─ Params: (n: int)
│  ├─ Return: int
│  └─ Body: If
│       ├─ Cond: BinOp(<=)
│       │    ├─ Ident(n)
│       │    └─ Lit(1)
│       ├─ Then: Ident(n)
│       └─ Else: BinOp(+)
│            ├─ Call(@fib)
│            │    └─ n: BinOp(-)
│            │         ├─ Ident(n)
│            │         └─ Lit(1)
│            └─ Call(@fib)
│                 └─ n: BinOp(-)
│                      ├─ Ident(n)
│                      └─ Lit(2)
├─ FnDecl @gcd
│  ├─ Params: (a: int, b: int)
│  ├─ Return: int
│  └─ Body: If
│       ├─ Cond: BinOp(==)
│       │    ├─ Ident(b)
│       │    └─ Lit(0)
│       ├─ Then: Ident(a)
│       └─ Else: Call(@gcd)
│            ├─ a: Ident(b)
│            └─ b: BinOp(%)
│                 ├─ Ident(a)
│                 └─ Ident(b)
└─ FnDecl @main
   ├─ Return: int
   └─ Body: Block
        ├─ Let f = Call(@fib, n: Lit(10))
        ├─ Let g = Call(@gcd, a: Lit(48), b: Lit(18))
        └─ BinOp(+)
             ├─ Ident(f)
             └─ Ident(g)

3. Type Checker

The type checker verifies that all expressions have compatible types using Hindley-Milner type inference. It resolves type variables, checks constraints, and ensures type safety without requiring explicit type annotations everywhere.

Constraints: 14 | Types inferred: 7 | Unifications: 12 | Errors: 0

Inferred types

@fib (n: int) -> int =
    if n <= 1 then n
    //  ^ bool (Comparable<int, int> -> bool)
    else fib(n: n - 1) + fib(n: n - 2)
    //       ^ int (Sub<int, int> -> int)
    //   ^ int (return type of @fib)
    //                      ^ int (Sub<int, int> -> int)
    //                  ^ int (return type of @fib)
    // ^ int (Add<int, int> -> int)

@gcd (a: int, b: int) -> int =
    if b == 0 then a
    //  ^ bool (Eq<int, int> -> bool)
    else gcd(a: b, b: a % b)
    //               ^ int (Rem<int, int> -> int)
    //   ^ int (return type of @gcd)

@main () -> int = {
    let f: int = fib(n: 10)   // inferred: int
    let g: int = gcd(a: 48, b: 18)  // inferred: int
    f + g  // -> int (Add<int, int> -> int)
}

4. Canonicalization

The canonicalizer transforms the typed AST into a simplified canonical form. It desugars syntactic sugar, lowers complex expressions, and prepares the IR for backend consumption.

Transforms: 3 | Desugared: 0 | Errors: 0

Key transformations

- Canon nodes: 40, roots: 3, constants: 6
- Function bodies lowered to canonical expression form
- Call arguments normalized to positional order
- Named argument labels resolved to positional indices

5. ARC Pipeline

The ARC (Automatic Reference Counting) pipeline analyzes value lifetimes and inserts reference counting operations. It performs borrow inference to minimize RC overhead — parameters that are only read can be borrowed rather than owned.

RC ops inserted: 0 | Elided: 0 | Net ops: 0

ARC annotations

@fib: no heap values -- pure scalar arithmetic
@gcd: no heap values -- pure scalar arithmetic
@main: no heap values -- pure scalar arithmetic

Backend: Interpreter

The interpreter (eval path) executes the canonical IR directly, without compilation. It serves as the reference implementation for correctness testing.

Result: 61 | Status: PASS

Evaluation trace

@main()
  └─ let f = @fib(n: 10)
       └─ if 10 <= 1 → false → else
            └─ @fib(n: 9) + @fib(n: 8)
                 └─ ... (tree recursion, 177 calls total)
            └─ = 55
  └─ let g = @gcd(a: 48, b: 18)
       └─ if 18 == 0 → false → else
            └─ @gcd(a: 18, b: 48 % 18 = 12)
                 └─ @gcd(a: 12, b: 18 % 12 = 6)
                      └─ @gcd(a: 6, b: 12 % 6 = 0)
                           └─ if 0 == 0 → true → 6
  └─ f + g = 55 + 6 = 61
→ 61

Backend: LLVM Codegen

The LLVM backend compiles the canonical IR to LLVM IR, which is then compiled to native machine code via LLVM’s optimization and code generation pipeline. This path produces ahead-of-time compiled binaries.

ARC Pipeline

RC ops inserted: 0 | Elided: 0 | Net ops: 0

ARC annotations

@fib: +0 rc_inc, +0 rc_dec (no heap values)
@gcd: +0 rc_inc, +0 rc_dec (no heap values)
@main: +0 rc_inc, +0 rc_dec (no heap values)

Generated LLVM IR

; ModuleID = '03-recursion'
source_filename = "03-recursion"

@ovf.msg = private unnamed_addr constant [32 x i8] c"integer overflow on subtraction\00", align 1
@ovf.msg.1 = private unnamed_addr constant [29 x i8] c"integer overflow on addition\00", align 1

; Function Attrs: uwtable
; --- @fib ---
define fastcc noundef i64 @_ori_fib(i64 noundef %0) #0 {
bb0:
  %le = icmp sle i64 %0, 1
  br i1 %le, label %bb3, label %bb2

bb2:                                              ; preds = %bb0
  %sub = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %0, i64 1)
  %sub.val = extractvalue { i64, i1 } %sub, 0
  %sub.ovf = extractvalue { i64, i1 } %sub, 1
  br i1 %sub.ovf, label %sub.ovf_panic, label %sub.ok

bb3:                                              ; preds = %sub.ok4, %bb0
  %v1478 = phi i64 [ %add.val, %sub.ok4 ], [ %0, %bb0 ]
  ret i64 %v1478

sub.ok:                                           ; preds = %bb2
  %call = call fastcc i64 @_ori_fib(i64 %sub.val)
  %sub1 = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %0, i64 2)
  %sub.val2 = extractvalue { i64, i1 } %sub1, 0
  %sub.ovf3 = extractvalue { i64, i1 } %sub1, 1
  br i1 %sub.ovf3, label %sub.ovf_panic5, label %sub.ok4

sub.ovf_panic:                                    ; preds = %bb2
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable

sub.ok4:                                          ; preds = %sub.ok
  %call6 = call fastcc i64 @_ori_fib(i64 %sub.val2)
  %add = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %call, i64 %call6)
  %add.val = extractvalue { i64, i1 } %add, 0
  %add.ovf = extractvalue { i64, i1 } %add, 1
  br i1 %add.ovf, label %add.ovf_panic, label %bb3

sub.ovf_panic5:                                   ; preds = %sub.ok
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable

add.ovf_panic:                                    ; preds = %sub.ok4
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable
}

; Function Attrs: nounwind memory(none) uwtable
; --- @gcd ---
define fastcc noundef i64 @_ori_gcd(i64 noundef %0, i64 noundef %1) #1 {
bb3:
  br label %bb0

bb0:                                              ; preds = %bb2, %bb3
  %v12 = phi i64 [ %0, %bb3 ], [ %v13, %bb2 ]
  %v13 = phi i64 [ %1, %bb3 ], [ %rem, %bb2 ]
  %eq = icmp eq i64 %v13, 0
  br i1 %eq, label %bb1, label %bb2

bb1:                                              ; preds = %bb0
  ret i64 %v12

bb2:                                              ; preds = %bb0
  %rem = srem i64 %v12, %v13
  br label %bb0
}

; Function Attrs: uwtable
; --- @main ---
define noundef i64 @_ori_main() #0 {
bb0:
  %call = call fastcc i64 @_ori_fib(i64 10)
  %call1 = call fastcc i64 @_ori_gcd(i64 48, i64 18)
  %add = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %call, i64 %call1)
  %add.val = extractvalue { i64, i1 } %add, 0
  %add.ovf = extractvalue { i64, i1 } %add, 1
  br i1 %add.ovf, label %add.ovf_panic, label %add.ok

add.ok:                                           ; preds = %bb0
  ret i64 %add.val

add.ovf_panic:                                    ; preds = %bb0
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable
}

; Function Attrs: nocallback nofree nosync nounwind speculatable willreturn memory(none)
declare { i64, i1 } @llvm.ssub.with.overflow.i64(i64, i64) #2

; Function Attrs: cold noreturn
declare void @ori_panic_cstr(ptr) #3

; Function Attrs: nocallback nofree nosync nounwind speculatable willreturn memory(none)
declare { i64, i1 } @llvm.sadd.with.overflow.i64(i64, i64) #2

; Function Attrs: uwtable
define noundef i32 @main() #0 {
entry:
  %ori_main_result = call i64 @_ori_main()
  %exit_code = trunc i64 %ori_main_result to i32
  %leak_check = call i32 @ori_check_leaks()
  %has_leak = icmp ne i32 %leak_check, 0
  %final_exit = select i1 %has_leak, i32 %leak_check, i32 %exit_code
  ret i32 %final_exit
}

; Function Attrs: nounwind
declare i32 @ori_check_leaks() #4

attributes #0 = { uwtable }
attributes #1 = { nounwind memory(none) uwtable }
attributes #2 = { nocallback nofree nosync nounwind speculatable willreturn memory(none) }
attributes #3 = { cold noreturn }
attributes #4 = { nounwind }

Disassembly

_ori_fib:
   sub    $0x28,%rsp           ; frame setup (40 bytes for spills)
   mov    %rdi,0x18(%rsp)      ; save n
   cmp    $0x1,%rdi            ; n <= 1?
   mov    %rdi,0x20(%rsp)
   jle    .base                ; jump to base case
   mov    0x18(%rsp),%rax
   dec    %rax                 ; n - 1
   mov    %rax,0x10(%rsp)
   seto   %al
   jo     .panic_sub1
   jmp    .sub1_ok
.base:
   mov    0x20(%rsp),%rax      ; return n
   add    $0x28,%rsp
   ret
.sub1_ok:
   mov    0x10(%rsp),%rdi
   call   _ori_fib             ; fib(n-1)
   mov    %rax,%rcx
   mov    0x18(%rsp),%rax
   mov    %rcx,(%rsp)
   sub    $0x2,%rax            ; n - 2
   mov    %rax,0x8(%rsp)
   seto   %al
   jo     .panic_sub2
   jmp    .sub2_ok
.panic_sub1:
   lea    ovf.msg(%rip),%rdi
   call   ori_panic_cstr
.sub2_ok:
   mov    0x8(%rsp),%rdi
   call   _ori_fib             ; fib(n-2)
   mov    %rax,%rcx
   mov    (%rsp),%rax
   add    %rcx,%rax            ; fib(n-1) + fib(n-2)
   seto   %cl
   test   $0x1,%cl
   mov    %rax,0x20(%rsp)
   jne    .panic_add
   jmp    .base                ; merge via base return path
.panic_sub2:
   lea    ovf.msg(%rip),%rdi
   call   ori_panic_cstr
.panic_add:
   lea    ovf.msg.1(%rip),%rdi
   call   ori_panic_cstr

_ori_gcd:
   mov    %rdi,-0x10(%rsp)     ; store a (red zone -- no frame needed)
   mov    %rsi,-0x8(%rsp)      ; store b
   jmp    .loop                ; entry jump to loop header
.loop:
   mov    -0x10(%rsp),%rax
   mov    -0x8(%rsp),%rdx
   mov    %rdx,-0x20(%rsp)
   mov    %rax,-0x18(%rsp)
   cmp    $0x0,%rdx            ; b == 0?
   jne    .body
   mov    -0x18(%rsp),%rax     ; return a
   ret
.body:
   mov    -0x20(%rsp),%rcx
   mov    -0x18(%rsp),%rax
   cqto                        ; sign-extend for idiv
   idiv   %rcx                 ; a / b (remainder in %rdx)
   mov    -0x20(%rsp),%rax
   mov    %rax,-0x10(%rsp)
   mov    %rdx,-0x8(%rsp)
   jmp    .loop

_ori_main:
   sub    $0x18,%rsp
   mov    $0xa,%edi            ; fib(10)
   call   _ori_fib
   mov    %rax,0x8(%rsp)       ; save result
   mov    $0x30,%edi           ; gcd(48, 18)
   mov    $0x12,%esi
   call   _ori_gcd
   mov    %rax,%rcx
   mov    0x8(%rsp),%rax
   add    %rcx,%rax            ; f + g
   mov    %rax,0x10(%rsp)
   seto   %al
   jo     .panic
   mov    0x10(%rsp),%rax
   add    $0x18,%rsp
   ret
.panic:
   lea    ovf.msg.1(%rip),%rdi
   call   ori_panic_cstr

main:
   push   %rax
   call   _ori_main
   mov    %eax,0x4(%rsp)       ; save ori result
   call   ori_check_leaks      ; RC leak detection
   mov    %eax,%ecx
   mov    0x4(%rsp),%eax
   cmp    $0x0,%ecx
   cmovne %ecx,%eax            ; leak detected -> override exit code
   pop    %rcx
   ret

Deep Scrutiny

1. Instruction Purity

#	Function	Actual	Ideal	Ratio	Verdict
1	@fib	24	24	1.00x	OPTIMAL
2	@gcd	8	8	1.00x	OPTIMAL
3	@main	9	9	1.00x	OPTIMAL

@fib (24 instructions): Tree-recursive function with two subtraction overflow checks, one addition overflow check, two recursive calls, and a phi-based merge. All instructions justified. The phi node in bb3 accepts values directly from %bb0 (base case) and %sub.ok4 (recursive case) without empty intermediate blocks.

@gcd (8 instructions): The tail recursion is correctly lowered to a loop with phi nodes. The entry block bb3 contains br label %bb0 which is structurally required as a phi predecessor — the phi nodes in bb0 reference %bb3 for initial values. This is the minimal form for the tail-recursion-to-loop transformation. [NOTE-1]

@main (9 instructions): Two calls, one overflow-checked addition, branch to ok/panic. All instructions justified.

2. ARC Purity

Function	Balanced	Borrow Elision	Move Semantics
@fib	YES	N/A	N/A
@gcd	YES	N/A	N/A
@main	YES	N/A	N/A

Verdict: No heap values. Zero RC operations. OPTIMAL. All parameters and return values are scalar i64.

3. Attributes & Calling Convention

Function	fastcc	nounwind	memory(none)	noalias	noundef	uwtable	Notes
@fib	YES	NO	NO	N/A	YES	YES	nounwind absent due to panic paths (correct)
@gcd	YES	YES	YES	N/A	YES	YES	Pure function, correctly annotated [NOTE-2]
@main	NO (C)	NO	NO	N/A	YES	YES	C calling convention for entry point (correct)
@panic	N/A	N/A	N/A	N/A	N/A	N/A	cold noreturn (correct)
main	NO (C)	NO	NO	N/A	YES	YES	C entry wrapper (correct)

Attribute compliance: 15/15 applicable attributes present (100.0%).

The nounwind absence on @fib and @main is semantically correct — @fib can panic on overflow (calling ori_panic_cstr which is noreturn but may unwind), and @main transitively calls @fib. The nounwind fixed-point analysis correctly identifies only @gcd as provably non-unwinding.

The memory(none) on @gcd correctly identifies it as a pure function — it operates exclusively on scalar i64 values with comparison and signed remainder, reading and writing no memory. This enables LLVM to apply CSE, LICM, and dead call elimination.

The @main entry point correctly uses C calling convention (not fastcc) for ABI compatibility with the C main() wrapper. The noundef attribute is correctly present on all user function return values and parameters.

4. Control Flow & Block Layout

Function	Blocks	Phi Nodes	Notes
@fib	8	1	[NOTE-3]
@gcd	4	2	[NOTE-1]
@main	3	0

@fib: 8 blocks with zero empty blocks. The phi node in bb3 directly receives values from %bb0 (base case) and %sub.ok4 (recursive case) — no intermediate trampoline blocks.

@gcd: 4 blocks. The entry block bb3 contains br label %bb0 which is structurally necessary: the phi nodes in bb0 require it as a predecessor to accept initial parameter values (%0 from %bb3 vs %v13/%rem from %bb2). This is the canonical form for loop-lowered tail recursion.

@main: 3 blocks — clean structure with entry, ok, and panic blocks. No defects.

Total control flow defects: 0.

5. Overflow Checking

Status: PASS

Operation	Checked	Correct	Notes
sub (n-1)	YES	YES	`llvm.ssub.with.overflow.i64`
sub (n-2)	YES	YES	`llvm.ssub.with.overflow.i64`
add (fib+fib)	YES	YES	`llvm.sadd.with.overflow.i64`
rem (a%b)	N/A	N/A	`srem` — no overflow possible for integer remainder
add (f+g)	YES	YES	`llvm.sadd.with.overflow.i64`

All arithmetic operations that can overflow are correctly checked. The srem in @gcd correctly does NOT use overflow checking (integer remainder cannot overflow). Panic messages correctly distinguish “subtraction” vs “addition” overflow.

6. Binary Analysis

Metric	Value
Binary size	6.25 MiB (debug)
.text section	869.8 KiB
.rodata section	133.5 KiB
User code (@fib)	160 bytes (34 native instructions)
User code (@gcd)	80 bytes (20 native instructions)
User code (@main)	80 bytes (18 native instructions)
User code (main wrapper)	29 bytes (10 native instructions)
Total user code	349 bytes
Runtime	>99% of binary

Disassembly: @fib

_ori_fib:
   sub    $0x28,%rsp           ; frame setup (40 bytes for spills)
   mov    %rdi,0x18(%rsp)      ; save n
   cmp    $0x1,%rdi            ; n <= 1?
   jle    .base                ; direct branch to base/merge block
   ; ... overflow-checked subtraction, two recursive calls,
   ;     overflow-checked addition (34 native instructions total)
.base:
   mov    0x20(%rsp),%rax      ; return merged value
   add    $0x28,%rsp
   ret

Disassembly: @gcd

_ori_gcd:
   mov    %rdi,-0x10(%rsp)     ; store a (red zone -- no frame needed)
   mov    %rsi,-0x8(%rsp)      ; store b
   jmp    .loop                ; entry jump to loop header
.loop:
   cmp    $0x0,%rdx            ; b == 0?
   jne    .body
   ret                         ; return a
.body:
   cqto                        ; sign-extend for idiv
   idiv   %rcx                 ; a / b (remainder in %rdx)
   jmp    .loop

Disassembly: @main

_ori_main:
   sub    $0x18,%rsp
   mov    $0xa,%edi            ; fib(10)
   call   _ori_fib
   mov    %rax,0x8(%rsp)       ; save result
   mov    $0x30,%edi           ; gcd(48, 18)
   mov    $0x12,%esi
   call   _ori_gcd
   add    %rcx,%rax            ; f + g
   ; overflow check + return

7. Optimal IR Comparison

@fib: Ideal vs Actual

; IDEAL (24 instructions -- matches actual)
define fastcc noundef i64 @_ori_fib(i64 noundef %n) uwtable {
entry:
  %le = icmp sle i64 %n, 1
  br i1 %le, label %base, label %recurse

recurse:
  %sub1 = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %n, i64 1)
  %n1 = extractvalue { i64, i1 } %sub1, 0
  %ovf1 = extractvalue { i64, i1 } %sub1, 1
  br i1 %ovf1, label %panic_sub, label %ok1

base:                                             ; preds = %ok2, %entry
  %result = phi i64 [ %sum, %ok2 ], [ %n, %entry ]
  ret i64 %result

ok1:
  %r1 = call fastcc i64 @_ori_fib(i64 %n1)
  %sub2 = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %n, i64 2)
  %n2 = extractvalue { i64, i1 } %sub2, 0
  %ovf2 = extractvalue { i64, i1 } %sub2, 1
  br i1 %ovf2, label %panic_sub2, label %ok2

ok2:
  %r2 = call fastcc i64 @_ori_fib(i64 %n2)
  %add = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %r1, i64 %r2)
  %sum = extractvalue { i64, i1 } %add, 0
  %ovf3 = extractvalue { i64, i1 } %add, 1
  br i1 %ovf3, label %panic_add, label %base

panic_sub:
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable
panic_sub2:
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable
panic_add:
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable
}

Delta: +0 unjustified instructions. The actual IR matches the ideal structure — base case and recursive case merge directly into the phi node without empty intermediate blocks.

@gcd: Ideal vs Actual

; IDEAL (8 instructions -- entry block + loop with phi nodes)
define fastcc noundef i64 @_ori_gcd(i64 noundef %a, i64 noundef %b) nounwind memory(none) uwtable {
entry:
  br label %loop

loop:
  %va = phi i64 [ %a, %entry ], [ %vb, %body ]
  %vb = phi i64 [ %b, %entry ], [ %rem, %body ]
  %eq = icmp eq i64 %vb, 0
  br i1 %eq, label %done, label %body

done:
  ret i64 %va

body:
  %rem = srem i64 %va, %vb
  br label %loop
}

; ACTUAL (8 instructions -- matches ideal)
define fastcc noundef i64 @_ori_gcd(i64 noundef %0, i64 noundef %1) #1 {
bb3:
  br label %bb0

bb0:
  %v12 = phi i64 [ %0, %bb3 ], [ %v13, %bb2 ]
  %v13 = phi i64 [ %1, %bb3 ], [ %rem, %bb2 ]
  %eq = icmp eq i64 %v13, 0
  br i1 %eq, label %bb1, label %bb2

bb1:
  ret i64 %v12

bb2:
  %rem = srem i64 %v12, %v13
  br label %bb0
}

Delta: +0 unjustified instructions. The entry block bb3 with br label %bb0 is structurally required as a phi predecessor for the initial parameter values. The actual and ideal have identical structure.

@main: Ideal vs Actual

; IDEAL (9 instructions)
define noundef i64 @_ori_main() uwtable {
entry:
  %f = call fastcc i64 @_ori_fib(i64 10)
  %g = call fastcc i64 @_ori_gcd(i64 48, i64 18)
  %add = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %f, i64 %g)
  %sum = extractvalue { i64, i1 } %add, 0
  %ovf = extractvalue { i64, i1 } %add, 1
  br i1 %ovf, label %panic, label %ok

ok:
  ret i64 %sum

panic:
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable
}

Delta: +0 unjustified instructions. Actual IR matches ideal exactly.

Module Summary

Function	Ideal	Actual	Delta	Justified	Verdict
@fib	24	24	+0	N/A	OPTIMAL
@gcd	8	8	+0	N/A	OPTIMAL
@main	9	9	+0	N/A	OPTIMAL

8. Recursion: Tail-Call Optimization

The compiler correctly identifies @gcd as tail-recursive and lowers it to an iterative loop. This is verified by the presence of phi nodes at the loop header and br label %bb0 forming the back-edge — a proper loop structure with no recursive call @_ori_gcd instruction.

In contrast, @fib is tree-recursive (two recursive calls whose results are combined with +), which cannot be converted to a simple loop. The compiler correctly preserves both recursive call fastcc i64 @_ori_fib(...) instructions.

Tail-call detection quality: Excellent. The nounwind analysis correctly determined only @gcd is nounwind (because it has no panic paths, being purely comparison and remainder operations). @fib is correctly identified as potentially unwinding due to overflow-checked arithmetic that can call ori_panic_cstr.

The memory(none) attribute on @gcd correctly identifies it as a pure function with no memory side effects — it operates exclusively on register-level scalar values (i64), performs only comparison and signed remainder, and returns a scalar. This enables LLVM to apply CSE, LICM, and dead call elimination on repeated gcd invocations.

The loop-lowered @gcd uses phi nodes cleanly:

%v12 = phi i64 [ %0, %bb3 ], [ %v13, %bb2 ] — a becomes the previous b
%v13 = phi i64 [ %1, %bb3 ], [ %rem, %bb2 ] — b becomes a % b

This is textbook Euclidean algorithm loop form.

9. Recursion: Stack Frame Efficiency

For tree-recursive @fib, each invocation creates a stack frame. The native disassembly shows sub $0x28, %rsp — a 40-byte frame. At ~10 levels deep for fib(10), the stack usage is modest (~400 bytes peak). For larger inputs this could become significant, but that is inherent to tree recursion, not a compiler inefficiency.

For @gcd, the loop-lowered form uses red zone storage (-0x10(%rsp) through -0x20(%rsp)) without needing a frame setup (sub $X, %rsp). This is excellent — the nounwind attribute allows the compiler to use the red zone safely, and the loop structure means constant O(1) stack usage regardless of recursion depth.

10. Recursion: Leak Detection Integration

The main wrapper now includes RC leak detection via ori_check_leaks(). For this pure-scalar program, the leak checker correctly finds zero leaks. The integration adds 6 instructions to the wrapper (call, compare, select pattern) but does not affect user function codegen quality. The cmovne instruction elegantly overrides the exit code only when leaks are detected, avoiding a branch.

Findings

#	Severity	Category	Description	Status	First Seen
1	NOTE	Instruction Purity	@gcd entry block structurally required for phi predecessors	CONFIRMED	J3
2	NOTE	Attributes	memory(none) on @gcd — pure function correctly annotated	CONFIRMED	J3
3	NOTE	Attributes	100% attribute compliance (15/15)	CONFIRMED	J3
4	NOTE	Recursion	Excellent tail-call optimization on @gcd	CONFIRMED	J3
5	NOTE	Control Flow	Zero empty blocks in @fib (CFG simplification working)	CONFIRMED	J3
6	NOTE	Recursion	Correct nounwind analysis (gcd only)	CONFIRMED	J3
7	NOTE	ARC	Zero RC operations on pure scalar recursion	CONFIRMED	J3
8	NOTE	Binary	Leak detection integrated in main wrapper	NEW	J3

NOTE-1: @gcd entry block structurally required

Location: @gcd, block bb3 Impact: Positive — the entry block bb3 with br label %bb0 is the minimal form for tail-recursion-to-loop lowering. The phi nodes in bb0 require a distinct entry predecessor (%bb3) separate from the back-edge predecessor (%bb2) to accept initial parameter values. This is LLVM’s canonical loop form. Found in: Instruction Purity (Category 1), Control Flow (Category 4)

NOTE-2: memory(none) on @gcd

Location: @gcd function declaration Impact: Positive — the posthoc memory analysis correctly identifies @gcd as a pure function with no memory effects. This enables LLVM to apply CSE, LICM, and dead call elimination on repeated calls. Found in: Attributes & Calling Convention (Category 3)

NOTE-3: Zero empty blocks in @fib

Location: @fib CFG Impact: Positive — the phi node in bb3 receives values directly from %bb0 (base case) and %sub.ok4 (recursive case). No empty trampoline blocks remain. The CFG simplification pass is working correctly. Found in: Control Flow & Block Layout (Category 4)

NOTE-4: 100% attribute compliance

Location: All function declarations Impact: Positive — all 15 applicable attributes are correct. noundef on all parameters and return values, fastcc on internal functions, C convention on entry points, nounwind memory(none) on pure @gcd, cold noreturn on panic. Found in: Attributes & Calling Convention (Category 3)

NOTE-5: Excellent tail-call optimization

Location: @gcd function Impact: Positive — tail recursion correctly lowered to an iterative loop with phi nodes, eliminating all recursive call overhead. The resulting loop is optimal. Found in: Recursion: Tail-Call Optimization (Category 8)

NOTE-6: Correct nounwind analysis

Location: Fixed-point nounwind analysis Impact: Positive — the compiler correctly identifies that @gcd (with only comparison and remainder) cannot unwind, while @fib (with overflow-checked arithmetic leading to ori_panic_cstr) can. This enables red zone usage in @gcd’s native code. Found in: Recursion: Tail-Call Optimization (Category 8)

NOTE-7: Zero RC operations on pure scalar recursion

Location: All three functions Impact: Positive — the compiler correctly identifies that all values are scalar i64 and emits zero reference counting operations. ARC is completely absent from this program. Found in: ARC Purity (Category 2)

NOTE-8: Leak detection integrated in main wrapper

Location: main wrapper function Impact: Positive — the main() C entry wrapper now calls ori_check_leaks() after _ori_main() returns, using a cmovne to conditionally override the exit code when leaks are detected. For this pure-scalar program, the leak checker correctly finds nothing. This integration adds safety without affecting user function codegen. Found in: Recursion: Leak Detection Integration (Category 10)

Codegen Quality Score

Category	Weight	Score	Notes
Instruction Efficiency	15%	10/10	1.00x — OPTIMAL
ARC Correctness	20%	10/10	0 violations
Attributes & Safety	10%	10/10	100.0% compliance
Control Flow	10%	10/10	0 defects
IR Quality	20%	10/10	0 unjustified instructions
Binary Quality	10%	10/10	0 defects
Other Findings	15%	10/10	No uncategorized findings

Overall: 10.0 / 10

Verdict

Journey 3’s recursion codegen achieves a perfect 10.0/10. All three functions produce OPTIMAL IR with zero unjustified instructions. The highlights are the tail-call optimization on @gcd (lowered to a textbook loop with phi nodes and annotated with nounwind memory(none)) and the zero-defect CFG in @fib (no empty trampoline blocks). The leak detection integration in the main wrapper adds safety infrastructure without impacting user function quality.

Cross-Journey Observations

Feature	First Tested	This Journey	Status
Overflow checking	J1	J3	CONFIRMED
fastcc usage	J1	J3	CONFIRMED
Zero-RC scalar programs	J1	J3	CONFIRMED
100% attribute compliance	J1	J3	CONFIRMED
Empty blocks from if/else	J2	J3	FIXED (CFG simplification)
Tail-call to loop lowering	—	J3	CONFIRMED
nounwind fixed-point analysis	—	J3	CONFIRMED
memory(none) pure function	—	J3	CONFIRMED
Leak detection in wrapper	—	J3	NEW

I am recursive

What you'll learn

Score Breakdown