Add a debug option to generate a flamegraph for compiler performance #4075

carolynzech · 2025-05-14T18:56:47Z

Proposed change: For a given code generation pass, we'd get the call tree annotated with the time spent in each function. For instance, if codegen_items calls codegen_function, which calls codegen_statement n times, etc., we'd get the time we spent in each of those calls, down to the end of the call tree. I'm imagining we'd visualize it as a tree because the depth of the call stack is also important, but I'm flexible on the format; the main point is getting data on how long different parts of codegen take, not just codegen as a whole.

Motivation: From some rudimentary benchmarking with with_timer:

kani/kani-compiler/src/codegen_cprover_gotoc/compiler_interface.rs

Lines 831 to 833 in 2227614

    
           /// Execute the provided function and measure the clock time it took for its execution. 
        
           /// Log the time with the given description. 
        
           pub fn with_timer<T, F>(func: F, description: &str) -> T

we've determined that most of Kani's compiler runtime comes from code generation. But we need more fine-grained data to know why code generation is slow. Currently, it's unclear whether:

Most functions are fast enough, but there are specific outliers we can blame, e.g., if codegening a function with fat pointers or dynamic dispatch is much slower than codegening functions without, or
Most functions are fast enough in isolation, but we have a deep call tree so their cumulative time is slow, suggesting that we need a larger-scale refactoring to produce speedups, or at least small-scale optimizations of multiple functions.

Having this data would help us make that determination.

The text was updated successfully, but these errors were encountered:

tautschnig · 2025-05-15T04:24:43Z

I previously used flamegraphs for such profiling, and the Rust profiling docs link to https://github.com/flamegraph-rs/flamegraph - would that perhaps be a good option?

carolynzech · 2025-05-15T17:49:04Z

I previously used flamegraphs for such profiling, and the Rust profiling docs link to https://github.com/flamegraph-rs/flamegraph - would that perhaps be a good option?

Yep, that looks great!

carolynzech added [E] Performance Track performance improvement (Time / Memory / CPU) [C] Internal Tracks some internal work. I.e.: Users should not be affected. labels May 14, 2025

carolynzech changed the title ~~Add a debug option to measure fine-grained compiler performance~~ Add a debug option to generate a flamegraph for compiler performance May 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a debug option to generate a flamegraph for compiler performance #4075

Add a debug option to generate a flamegraph for compiler performance #4075

carolynzech commented May 14, 2025

tautschnig commented May 15, 2025

Uh oh!

carolynzech commented May 15, 2025

Uh oh!

Add a debug option to generate a flamegraph for compiler performance #4075

Add a debug option to generate a flamegraph for compiler performance #4075

Comments

carolynzech commented May 14, 2025

tautschnig commented May 15, 2025

Uh oh!

carolynzech commented May 15, 2025

Uh oh!