Journey 1: “I am arithmetic”

Source

// Journey 1: "I am arithmetic"
// Slug: arithmetic
// Difficulty: simple
// Features: arithmetic, function_calls, let_bindings, int_literals, multiple_functions
// Expected: (3 + 4) * 5 - 2 = 33

@add (a: int, b: int) -> int = a + b;

@main () -> int = {
    let x = 3;
    let y = 4;
    let sum = add(a: x, b: y);   // = 7
    let result = sum * 5 - 2;  // = 35 - 2 = 33
    result
}

Execution Results

Backend	Exit Code	Expected	Stdout	Stderr	Status
Eval	33	33	(none)	(none)	PASS
AOT	33	33	(none)	(none)	PASS

Compiler Pipeline

1. Lexer

The lexer (tokenizer) breaks raw source text into a stream of tokens — the smallest meaningful units like keywords, identifiers, operators, and literals. This is the first stage of every compiler.

Tokens: 77 | Keywords: 4 | Identifiers: 11 | Errors: 0

The source file is 387 bytes. The lexer produces 77 tokens with zero errors. Keywords include let (x4) and @ function markers (x2). Identifiers cover function names, parameter names, type names, and local bindings.

Token stream

Fn(@) Ident(add) LParen Ident(a) Colon Ident(int) Comma
Ident(b) Colon Ident(int) RParen Arrow Ident(int) Eq
Ident(a) Plus Ident(b) Semi

Fn(@) Ident(main) LParen RParen Arrow Ident(int) Eq LBrace
Let Ident(x) Eq Int(3) Semi
Let Ident(y) Eq Int(4) Semi
Let Ident(sum) Eq Ident(add) LParen Ident(a) Colon Ident(x)
  Comma Ident(b) Colon Ident(y) RParen Semi
Let Ident(result) Eq Ident(sum) Star Int(5) Minus Int(2) Semi
Ident(result) RBrace

2. Parser

The parser transforms the flat token stream into a hierarchical Abstract Syntax Tree (AST) — a tree structure that represents the grammatical structure of the program. Operator precedence is resolved here: * binds tighter than -.

Nodes: 16 | Max depth: 4 | Functions: 2 | Errors: 0

The parser produces 16 expression nodes across 2 function declarations. The sum * 5 - 2 expression is correctly parsed with * binding tighter than -, producing (sum * 5) - 2 not sum * (5 - 2). Named arguments a: and b: are preserved in the AST for later resolution to positional order.

AST (simplified)

Module
├─ FnDecl @add
│  ├─ Params: (a: int, b: int)
│  ├─ Return: int
│  └─ Body: BinOp(+)
│       ├─ Ident(a)
│       └─ Ident(b)
└─ FnDecl @main
   ├─ Return: int
   └─ Body: Block
        ├─ Let x = Lit(3)
        ├─ Let y = Lit(4)
        ├─ Let sum = Call(@add)
        │    ├─ a: Ident(x)
        │    └─ b: Ident(y)
        ├─ Let result = BinOp(-)
        │    ├─ BinOp(*)
        │    │    ├─ Ident(sum)
        │    │    └─ Lit(5)
        │    └─ Lit(2)
        └─ Ident(result)

3. Type Checker

The type checker verifies that all expressions have compatible types using Hindley-Milner type inference. It resolves type variables, checks constraints, and ensures type safety without requiring explicit annotations everywhere.

Constraints: 12 | Types inferred: 6 | Unifications: 10 | Errors: 0

All types resolve to int. The 6 inferred bindings are: x, y, sum, result, plus the two operator results (sum * 5 and (sum * 5) - 2). The type checker confirms that Add<int, int> -> int, Mul<int, int> -> int, and Sub<int, int> -> int are all valid. The return type of @main matches the final expression result: int.

Inferred types

@add (a: int, b: int) -> int = a + b
//                               ^ int (Add<int, int> -> int)

@main () -> int = {
    let x: int = 3           // inferred from literal
    let y: int = 4           // inferred from literal
    let sum: int = add(a: x, b: y)  // inferred from @add return type
    let result: int = sum * 5 - 2   // int (Mul then Sub)
    result  // -> int (matches return type)
}

4. Canonicalization

The canonicalizer transforms the typed AST into a simplified canonical form — a flat sequence of operations suitable for backend consumption. It desugars syntactic sugar, lowers complex expressions, and resolves named arguments to positional order.

Transforms: 2 | Desugared: 0 | Errors: 0

The canonicalizer produces 20 canon nodes from 16 AST nodes (let bindings create extra pattern nodes). Named arguments a: x, b: y are resolved to positional order. No desugaring is needed — there is no syntactic sugar in this program (no compound assignment, no pipe operators, no spread syntax).

Key transformations

- 20 canon nodes from 16 AST nodes (let bindings create extra pattern nodes)
- 2 roots: @add, @main
- 6 constants: int literals 3, 4, 5, 2 plus function-level metadata
- 0 decision trees (no pattern matching)
- Named arguments (a: x, b: y) resolved to positional order

5. ARC Pipeline

The ARC (Automatic Reference Counting) pipeline analyzes value lifetimes and inserts reference counting operations. It performs borrow inference to minimize RC overhead — parameters that are only read can be borrowed rather than owned.

RC ops inserted: 0 | Elided: 0 | Net ops: 0

This program uses only int scalars (i64), which are value types stored directly in registers. No heap allocation occurs, so no reference counting is needed. This is the optimal outcome for a scalar-only program.

ARC annotations

@add: no heap values -- pure scalar arithmetic (int params, int return)
@main: no heap values -- all let bindings hold int scalars
Total RC ops: 0 (optimal for scalar-only program)

Backend: Interpreter

The interpreter (eval path) executes the canonical IR directly, without compilation. It serves as the reference implementation for correctness testing — if eval and AOT disagree, the bug is in LLVM codegen, not the interpreter.

Result: 33 | Status: PASS

The eval trace shows the canonical execution order: @main evaluates the block, binds x=3, y=4, calls @add(3,4)=7, computes 7*5=35, then 35-2=33. The final expression result evaluates to Int(33).

Evaluation trace

@main()
  ├─ let x = 3
  ├─ let y = 4
  ├─ let sum = @add(a: 3, b: 4)
  │    └─ 3 + 4 = 7
  ├─ let result = 7 * 5 - 2
  │    ├─ 7 * 5 = 35
  │    └─ 35 - 2 = 33
  └─ result = 33
-> 33

Backend: LLVM Codegen

The LLVM backend compiles the canonical IR to LLVM IR, which is then compiled to native machine code via LLVM’s optimization and code generation pipeline. This path produces ahead-of-time compiled binaries.

ARC Pipeline

RC ops inserted: 0 | Elided: 0 | Net ops: 0

ARC annotations

@_ori_add: +0 rc_inc, +0 rc_dec (pure scalar -- no heap values)
@_ori_main: +0 rc_inc, +0 rc_dec (pure scalar -- no heap values)
Nounwind analysis: 2 passes (fixed-point), both functions marked nounwind

Generated LLVM IR

; ModuleID = '01-arithmetic'
source_filename = "01-arithmetic"

@ovf.msg = private unnamed_addr constant [29 x i8] c"integer overflow on addition\00", align 1
@ovf.msg.1 = private unnamed_addr constant [35 x i8] c"integer overflow on multiplication\00", align 1
@ovf.msg.2 = private unnamed_addr constant [32 x i8] c"integer overflow on subtraction\00", align 1

; Function Attrs: nounwind uwtable
; --- @add ---
define fastcc i64 @_ori_add(i64 %0, i64 %1) #0 {
bb0:
  %add = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %0, i64 %1)
  %add.val = extractvalue { i64, i1 } %add, 0
  %add.ovf = extractvalue { i64, i1 } %add, 1
  br i1 %add.ovf, label %add.ovf_panic, label %add.ok

add.ok:                                           ; preds = %bb0
  ret i64 %add.val

add.ovf_panic:                                    ; preds = %bb0
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable
}

; Function Attrs: nounwind uwtable
; --- @main ---
define i64 @_ori_main() #0 {
bb0:
  %call = call fastcc i64 @_ori_add(i64 3, i64 4)
  %mul = call { i64, i1 } @llvm.smul.with.overflow.i64(i64 %call, i64 5)
  %mul.val = extractvalue { i64, i1 } %mul, 0
  %mul.ovf = extractvalue { i64, i1 } %mul, 1
  br i1 %mul.ovf, label %mul.ovf_panic, label %mul.ok

mul.ok:                                           ; preds = %bb0
  %sub = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %mul.val, i64 2)
  %sub.val = extractvalue { i64, i1 } %sub, 0
  %sub.ovf = extractvalue { i64, i1 } %sub, 1
  br i1 %sub.ovf, label %sub.ovf_panic, label %sub.ok

mul.ovf_panic:                                    ; preds = %bb0
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable

sub.ok:                                           ; preds = %mul.ok
  ret i64 %sub.val

sub.ovf_panic:                                    ; preds = %mul.ok
  call void @ori_panic_cstr(ptr @ovf.msg.2)
  unreachable
}

; Function Attrs: nocallback nofree nosync nounwind speculatable willreturn memory(none)
declare { i64, i1 } @llvm.sadd.with.overflow.i64(i64, i64) #1

; Function Attrs: cold
declare void @ori_panic_cstr(ptr) #2

; Function Attrs: nocallback nofree nosync nounwind speculatable willreturn memory(none)
declare { i64, i1 } @llvm.smul.with.overflow.i64(i64, i64) #1

; Function Attrs: nocallback nofree nosync nounwind speculatable willreturn memory(none)
declare { i64, i1 } @llvm.ssub.with.overflow.i64(i64, i64) #1

define i32 @main() {
entry:
  %ori_main_result = call i64 @_ori_main()
  %exit_code = trunc i64 %ori_main_result to i32
  ret i32 %exit_code
}

attributes #0 = { nounwind uwtable }
attributes #1 = { nocallback nofree nosync nounwind speculatable willreturn memory(none) }
attributes #2 = { cold }

Disassembly

000000000001b100 <_ori_add>:                    ; 31 bytes
   1b100:  push   %rax                   ; save scratch register / align stack
   1b101:  add    %rsi,%rdi              ; a + b (sets overflow flag)
   1b104:  mov    %rdi,(%rsp)            ; spill result to stack (O0 regalloc)
   1b108:  seto   %al                    ; set AL=1 if overflow occurred
   1b10b:  jo     panic                  ; branch to panic if overflow
   1b10d:  mov    (%rsp),%rax            ; reload result from stack (O0 regalloc)
   1b111:  pop    %rcx                   ; restore stack
   1b112:  ret                           ; return result in %rax
   ; --- overflow path ---
   1b113:  lea    ovf.msg(%rip),%rdi     ; load panic message address
   1b11a:  call   ori_panic_cstr         ; panic (does not return)

000000000001b120 <_ori_main>:                   ; 93 bytes
   1b120:  sub    $0x18,%rsp              ; stack frame (24 bytes)
   1b124:  mov    $0x3,%edi               ; arg a = 3
   1b129:  mov    $0x4,%esi               ; arg b = 4
   1b12e:  call   _ori_add               ; sum = add(3, 4) -> 7
   1b133:  mov    $0x5,%ecx              ; load constant 5
   1b138:  imul   %rcx,%rax              ; sum * 5 = 35
   1b13c:  mov    %rax,0x10(%rsp)        ; spill mul result (O0)
   1b141:  seto   %al                    ; overflow check (mul)
   1b144:  jo     mul_panic              ; branch if overflow
   1b146:  mov    0x10(%rsp),%rax        ; reload mul result (O0)
   1b14b:  sub    $0x2,%rax              ; 35 - 2 = 33
   1b14f:  mov    %rax,0x8(%rsp)         ; spill sub result (O0)
   1b154:  seto   %al                    ; overflow check (sub)
   1b157:  jo     sub_panic              ; branch if overflow
   1b159:  jmp    epilogue               ; jump over panic blocks
   ; --- mul overflow path ---
   1b15b:  lea    ovf.msg.1(%rip),%rdi
   1b162:  call   ori_panic_cstr
   ; --- epilogue ---
   1b167:  mov    0x8(%rsp),%rax         ; reload final result (O0)
   1b16c:  add    $0x18,%rsp             ; restore stack
   1b170:  ret                           ; return 33
   ; --- sub overflow path ---
   1b171:  lea    ovf.msg.2(%rip),%rdi
   1b178:  call   ori_panic_cstr

000000000001b180 <main>:                        ; 8 bytes
   1b180:  push   %rax
   1b181:  call   _ori_main
   1b186:  pop    %rcx
   1b187:  ret

Deep Scrutiny

1. Instruction Purity

#	Function	Actual (IR)	Ideal (IR)	Ratio	Verdict
1	@add	7	7	1.00x	OPTIMAL
2	@main	14	14	1.00x	OPTIMAL
3	main wrapper	3	3	1.00x	OPTIMAL

@add (7 instructions): Every instruction is justified. The overflow-checked addition requires the intrinsic call (1), two extractvalues to split the result and overflow flag (2), a conditional branch (1), a return (1), and a panic path with call + unreachable (2). No wasted instructions. OPTIMAL.

@main (14 instructions): Every instruction is justified. The function calls @_ori_add (1), performs overflow-checked multiplication (call + 2 extractvalue + branch = 4), overflow-checked subtraction (call + 2 extractvalue + branch = 4), returns (1), and has two panic paths (2 x (call + unreachable) = 4). The block merge pass has eliminated the previously-observed redundant br label %bb1 bridge block — the call result now flows directly into the multiply in the same bb0 block. OPTIMAL.

Let binding elimination: All four let bindings (x, y, sum, result) are correctly eliminated — no alloca/store/load chains. Constants 3 and 4 are inlined directly as call arguments. The call result feeds directly into the multiply intrinsic. This is -O1 quality codegen in a debug build.

Native instruction overhead: The native disassembly shows additional overhead from LLVM’s -O0 register allocation — stack spills (mov %rax,0x10(%rsp) / mov 0x10(%rsp),%rax) that would be eliminated at -O1. This is expected for unoptimized builds and not charged against IR quality.

2. ARC Purity

Function	rc_inc	rc_dec	Balanced	Borrow Elision	Move Semantics
@add	0	0	YES	N/A	N/A
@main	0	0	YES	N/A	N/A

Verdict: Zero RC operations. Correct — this program uses only int scalars (i64), which are value types requiring no reference counting. No ori_rc_inc, ori_rc_dec, or any RC-related calls present in the IR or disassembly. OPTIMAL.

3. Attributes & Calling Convention

Function	fastcc	nounwind	uwtable	noalias	readonly	cold	Notes
@add	YES	YES	YES	N/A	N/A	N/A
@main	NO (C)	YES	YES	N/A	N/A	N/A	C conv for entry point — correct
main wrapper	NO (C)	NO	NO	N/A	N/A	N/A
ori_panic_cstr	N/A	N/A	N/A	N/A	N/A	YES	Missing noreturn [LOW-1]

@_ori_add uses fastcc: Correct. Internal function benefits from fast calling convention (callee-pops, register preference). The nounwind attribute is present (fixed-point analysis confirms both user functions do not unwind). uwtable is present for stack unwinding support.

@_ori_main uses C convention: Correct. Called from the C main() wrapper, must use C ABI for compatibility with the system entry point. Also marked nounwind and uwtable.

ori_panic_cstr has cold but not noreturn: This function never returns (it calls longjmp or abort). Missing noreturn means LLVM cannot eliminate dead code after panic calls and may generate suboptimal branch layouts. However, the unreachable instruction after each call site already conveys non-return semantics to LLVM. The impact is therefore minimal for this program. The cold attribute is correct and helps branch prediction.

Attribute compliance: 7 applicable attributes checked (fastcc, nounwind, uwtable on @add; nounwind, uwtable on @main; noreturn on ori_panic_cstr; cold on ori_panic_cstr). 6 of 7 correct — only noreturn on ori_panic_cstr is missing. 85.7% compliance.

4. Control Flow & Block Layout

Function	Blocks	Notes
@add	3	Optimal layout
@main	5	Optimal — block merge pass eliminated bridge block [NOTE-2]
main wrapper	1	Optimal

@add block layout: 3 blocks — bb0 (entry with overflow check), add.ok (happy-path return), add.ovf_panic (cold panic). Happy path is fallthrough from the conditional branch. Panic block placed at the end. Optimal layout.

@main block layout: 5 blocks with zero redundant branches. The block merge pass has eliminated the bb0 -> bb1 bridge block that was present in the previous run. The call to @_ori_add and the subsequent overflow-checked multiply now reside in the same bb0 block:

bb0: call to @_ori_add + overflow-checked multiply + conditional branch
mul.ok: overflow-checked subtract + conditional branch
mul.ovf_panic: cold panic for multiply overflow
sub.ok: return
sub.ovf_panic: cold panic for subtract overflow

Panic blocks are correctly placed after the happy path, and the cold attribute on ori_panic_cstr helps LLVM’s branch prediction heuristics. Optimal layout.

5. Overflow Checking

Status: PASS

Operation	Intrinsic	Checked	Correct	Panic Message
`a + b`	`llvm.sadd.with.overflow.i64`	YES	YES	”integer overflow on addition”
`sum * 5`	`llvm.smul.with.overflow.i64`	YES	YES	”integer overflow on multiplication”
`sum * 5 - 2`	`llvm.ssub.with.overflow.i64`	YES	YES	”integer overflow on subtraction”

All three arithmetic operations use the correct LLVM signed overflow intrinsics. Each operation has a dedicated human-readable panic message stored as a global constant string. The overflow flag is checked with a conditional branch (br i1 %ovf) routing to a cold-attributed panic function followed by unreachable. This is correct per the Ori spec: “overflow panics.”

The panic messages are operation-specific (not generic), which is good for debugging. The message strings are private unnamed_addr constant with NUL termination, which is correct for C string interop with ori_panic_cstr.

6. Binary Analysis

Metric	Value
Binary size	6.25 MiB (6,554,584 bytes, debug)
.text section	868 KiB (889,377 bytes)
.rodata section	134 KiB (136,778 bytes)
.debug_info	1.56 MiB (1,638,828 bytes)
.debug_str	1.72 MiB (1,803,885 bytes)
.eh_frame	109 KiB (111,976 bytes)
User code (@add)	31 bytes
User code (@main)	93 bytes
User code (main wrapper)	8 bytes
User code total	132 bytes
User code % of .text	0.015%
Runtime % of binary	~99.98%

The binary is large due to static linking of ori_rt (the Ori runtime, which includes Rust’s standard library for panic handling, I/O, memory allocation, etc.) and full debug symbols (3.28 MiB of .debug_* sections). The user’s actual code is 132 bytes — everything else is runtime infrastructure. This is expected for a debug build of a statically-linked binary.

Note that @main shrank from 101 bytes (previous run) to 93 bytes, a direct result of the block merge pass eliminating the bridge block and its corresponding native jmp instruction.

Disassembly: @add

_ori_add:                        ; 31 bytes, 10 instructions (+ 1 nop)
  push   %rax                   ; save scratch register / align stack
  add    %rsi,%rdi              ; a + b (sets overflow flag)
  mov    %rdi,(%rsp)            ; spill result to stack (O0 regalloc)
  seto   %al                    ; set AL=1 if overflow occurred
  jo     panic                  ; branch to panic if overflow
  mov    (%rsp),%rax            ; reload result from stack (O0 regalloc)
  pop    %rcx                   ; restore stack
  ret                           ; return result in %rax
  ; --- overflow path ---
  lea    ovf.msg(%rip),%rdi     ; load panic message address
  call   ori_panic_cstr         ; panic (does not return)

Disassembly: @main

_ori_main:                       ; 93 bytes, 22 instructions
  sub    $0x18,%rsp              ; stack frame (24 bytes)
  mov    $0x3,%edi               ; arg a = 3
  mov    $0x4,%esi               ; arg b = 4
  call   _ori_add               ; sum = add(3, 4) -> 7
  mov    $0x5,%ecx              ; load constant 5
  imul   %rcx,%rax              ; sum * 5 = 35
  mov    %rax,0x10(%rsp)        ; spill mul result (O0)
  seto   %al                    ; overflow check (mul)
  jo     mul_panic              ; branch if overflow
  mov    0x10(%rsp),%rax        ; reload mul result (O0)
  sub    $0x2,%rax              ; 35 - 2 = 33
  mov    %rax,0x8(%rsp)         ; spill sub result (O0)
  seto   %al                    ; overflow check (sub)
  jo     sub_panic              ; branch if overflow
  jmp    epilogue               ; jump over panic blocks
  ; --- mul overflow path ---
  lea    ovf.msg.1(%rip),%rdi
  call   ori_panic_cstr
  ; --- epilogue ---
  mov    0x8(%rsp),%rax         ; reload final result (O0)
  add    $0x18,%rsp             ; restore stack
  ret                           ; return 33
  ; --- sub overflow path ---
  lea    ovf.msg.2(%rip),%rdi
  call   ori_panic_cstr

7. Optimal IR Comparison

@add: Ideal vs Actual

; IDEAL (7 instructions -- overflow checking is mandatory)
define fastcc i64 @_ori_add(i64 %a, i64 %b) #0 {
entry:
  %r = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %a, i64 %b)
  %val = extractvalue { i64, i1 } %r, 0
  %ovf = extractvalue { i64, i1 } %r, 1
  br i1 %ovf, label %panic, label %ok
ok:
  ret i64 %val
panic:
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable
}

; ACTUAL (7 instructions)
define fastcc i64 @_ori_add(i64 %0, i64 %1) #0 {
bb0:
  %add = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 %0, i64 %1)
  %add.val = extractvalue { i64, i1 } %add, 0
  %add.ovf = extractvalue { i64, i1 } %add, 1
  br i1 %add.ovf, label %add.ovf_panic, label %add.ok
add.ok:
  ret i64 %add.val
add.ovf_panic:
  call void @ori_panic_cstr(ptr @ovf.msg)
  unreachable
}

Delta: 0 instructions. The actual IR matches the ideal IR exactly in structure and instruction count. Only differences are naming conventions (bb0/add.ok/add.ovf_panic vs entry/ok/panic) and unnamed parameters (%0, %1 vs %a, %b). OPTIMAL.

@main: Ideal vs Actual

; IDEAL (14 instructions)
define i64 @_ori_main() #0 {
entry:
  %sum = call fastcc i64 @_ori_add(i64 3, i64 4)
  %mul = call { i64, i1 } @llvm.smul.with.overflow.i64(i64 %sum, i64 5)
  %mul.v = extractvalue { i64, i1 } %mul, 0
  %mul.o = extractvalue { i64, i1 } %mul, 1
  br i1 %mul.o, label %mul_panic, label %mul_ok
mul_ok:
  %sub = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %mul.v, i64 2)
  %sub.v = extractvalue { i64, i1 } %sub, 0
  %sub.o = extractvalue { i64, i1 } %sub, 1
  br i1 %sub.o, label %sub_panic, label %sub_ok
sub_ok:
  ret i64 %sub.v
mul_panic:
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable
sub_panic:
  call void @ori_panic_cstr(ptr @ovf.msg.2)
  unreachable
}

; ACTUAL (14 instructions)
define i64 @_ori_main() #0 {
bb0:
  %call = call fastcc i64 @_ori_add(i64 3, i64 4)
  %mul = call { i64, i1 } @llvm.smul.with.overflow.i64(i64 %call, i64 5)
  %mul.val = extractvalue { i64, i1 } %mul, 0
  %mul.ovf = extractvalue { i64, i1 } %mul, 1
  br i1 %mul.ovf, label %mul.ovf_panic, label %mul.ok
mul.ok:
  %sub = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 %mul.val, i64 2)
  %sub.val = extractvalue { i64, i1 } %sub, 0
  %sub.ovf = extractvalue { i64, i1 } %sub, 1
  br i1 %sub.ovf, label %sub.ovf_panic, label %sub.ok
mul.ovf_panic:
  call void @ori_panic_cstr(ptr @ovf.msg.1)
  unreachable
sub.ok:
  ret i64 %sub.val
sub.ovf_panic:
  call void @ori_panic_cstr(ptr @ovf.msg.2)
  unreachable
}

Delta: 0 instructions. The actual IR matches the ideal IR exactly. The block merge pass has eliminated the previously-observed redundant br label %bb1 bridge block between the call and the multiply. The call to @_ori_add and the overflow-checked multiply now reside in the same bb0 block, exactly as in the ideal IR. OPTIMAL.

main wrapper: Ideal vs Actual

; IDEAL (3 instructions)
define i32 @main() {
entry:
  %r = call i64 @_ori_main()
  %c = trunc i64 %r to i32
  ret i32 %c
}

; ACTUAL (3 instructions)
define i32 @main() {
entry:
  %ori_main_result = call i64 @_ori_main()
  %exit_code = trunc i64 %ori_main_result to i32
  ret i32 %exit_code
}

Delta: 0 instructions. OPTIMAL.

Module Summary

Function	Ideal	Actual	Delta	Justified	Verdict
@add	7	7	+0	N/A	OPTIMAL
@main	14	14	+0	N/A	OPTIMAL
main wrapper	3	3	+0	N/A	OPTIMAL
Total	24	24	+0

8. Arithmetic: Let Binding Elimination

All four let bindings are compiled away entirely — no alloca, no store, no load. Values flow directly as SSA registers:

let x = 3 — constant i64 3 passed directly to @_ori_add as first argument
let y = 4 — constant i64 4 passed directly to @_ori_add as second argument
let sum = add(...) — %call result feeds directly into smul.with.overflow
let result = sum * 5 - 2 — %sub.val is the return value directly

This is excellent codegen — many compilers at -O0 would emit alloca+store+load for each binding. Ori’s codegen emits direct SSA, which is closer to -O1 quality even in debug builds. The IR treats let bindings as SSA value bindings rather than stack slots, which is the correct semantic for immutable scalar let bindings.

9. Arithmetic: Constant Propagation Opportunities

Expression	Foldable?	Folded?	Notes
`add(a: 3, b: 4)`	YES (interprocedural)	NO	Call emitted — acceptable, requires inlining
`7 * 5`	Depends on fold of add	NO	Would require folding add first
`35 - 2`	Depends on fold of mul	NO	Would require folding multiply first

The codegen does not perform interprocedural constant folding — @add is a separate function that might have side effects from overflow checking. LLVM’s -O1+ passes would inline @add and fold the entire main body to ret i64 33. Current -O0 behavior is correct — constant folding across function boundaries is an optimization, not a correctness requirement. At -O0, maintaining separate functions is better for debugging (breakpoints, stack traces).

Findings

#	Severity	Category	Description	Status	First Seen
1	LOW	Attributes	Missing `noreturn` on `ori_panic_cstr` declaration	CONFIRMED	J1
2	NOTE	Control Flow	Block merge pass eliminated redundant `bb0 -> bb1` bridge	FIXED	J1
3	NOTE	Instruction Purity	Let bindings eliminated to direct SSA — O1-quality at O0	CONFIRMED	J1
4	NOTE	Instruction Purity	Both user functions match ideal IR exactly — OPTIMAL	NEW	J1

LOW-1: Missing `noreturn` on `ori_panic_cstr` declaration

Location: declare void @ori_panic_cstr(ptr) #2 where #2 = { cold } Impact: Without noreturn, LLVM may not fully optimize code paths after panic calls. However, the unreachable instruction after each call site already conveys non-return semantics to LLVM at the call site level, minimizing the practical impact. The cold attribute correctly marks the function as unlikely. Fix: Add noreturn to the ori_panic_cstr declaration. The attribute group should be { cold noreturn }. This should be changed in compiler/ori_llvm/src/codegen/runtime_decl/mod.rs. First seen: Journey 1 Found in: Attributes & Calling Convention (Category 3)

NOTE-2: Block merge pass eliminated redundant bridge block

Location: _ori_main, previously had bb0: ... br label %bb1 / bb1: %mul = ... Impact: Positive. The newly-implemented block merge pass correctly identified and eliminated the unconditional branch between bb0 and bb1. The call to @_ori_add and the overflow-checked multiply now reside in the same block. This removed 1 instruction from @main (15 to 14) and 8 bytes from the native code (101 to 93 bytes). @main now matches the ideal IR exactly. Previous state: MEDIUM-1 in previous run (redundant unconditional branch, 6.7% overhead) Found in: Control Flow & Block Layout (Category 4)

NOTE-3: Let bindings eliminated to direct SSA

Location: _ori_main — all four let bindings compiled to SSA registers Impact: Positive. No alloca/store/load chains for scalar let bindings. Values flow directly from definition to use as SSA values. This is -O1 quality codegen in a debug build — many compilers (including Clang at -O0) would emit stack operations for every local variable. Found in: Arithmetic: Let Binding Elimination (Category 8)

NOTE-4: Both user functions match ideal IR exactly

Location: _ori_add and _ori_main Impact: Positive. Both user functions now produce instruction counts exactly matching the hand-written ideal IR. @add has always been optimal (7/7). @main improved from 15 to 14 instructions thanks to the block merge pass, now achieving 14/14 — a perfect score. The entire module has zero unjustified overhead. Found in: Optimal IR Comparison (Category 7)

Codegen Quality Score

Category	Weight	Score	Notes
Instruction Efficiency	15%	10/10	1.00x — OPTIMAL
ARC Correctness	20%	10/10	0 violations
Attributes & Safety	10%	7/10	85.7% compliance
Control Flow	10%	10/10	0 defects
IR Quality	20%	10/10	0 unjustified instructions
Binary Quality	10%	10/10	0 defects
Other Findings	15%	10/10	No uncategorized findings

Overall: 9.7 / 10

Verdict

Journey 1’s arithmetic codegen has reached near-perfection. Both @add and @main now match the hand-written ideal IR instruction-for-instruction — zero overhead beyond mandatory overflow checking. The block merge pass, implemented in this session, eliminated the previously-observed redundant bridge block in @main, improving the score from 9.2 to 9.7. The only remaining gap is the missing noreturn attribute on ori_panic_cstr, which has minimal practical impact since unreachable already terminates each call site.

Cross-Journey Observations

Feature	First Tested	This Journey	Status
Overflow checking	J1	J1	CONFIRMED (correct)
fastcc usage	J1	J1	CONFIRMED (correct)
nounwind attribute	J1	J1	CONFIRMED (now present — was missing in old-format J1)
Redundant bridge blocks	J1	J1	FIXED (block merge pass)
Missing noreturn on panic	J1	J1	CONFIRMED (still missing)

The block merge pass is a significant improvement to codegen quality. For this simple arithmetic journey, it eliminated the only unjustified overhead, bringing the instruction ratio to a perfect 1.00x. The impact on more complex journeys (with more let-binding boundaries and deeper block nesting) should be investigated in subsequent journey reruns.

I am arithmetic

What you'll learn

Score Breakdown