Arborist

Multi-language code complexity metrics powered by tree-sitter.

Arborist computes cognitive complexity (SonarSource), cyclomatic complexity (McCabe), and source lines of code (SLOC) for functions and methods across 12 programming languages -- all from a single, embeddable Rust library.

Supported Languages

Language	Feature flag	Extensions	Default
Rust	`rust`	`.rs`	Yes
Python	`python`	`.py`, `.pyi`	Yes
JavaScript	`javascript`	`.js`, `.jsx`, `.mjs`, `.cjs`	Yes
TypeScript	`typescript`	`.ts`, `.tsx`, `.mts`, `.cts`	Yes
Java	`java`	`.java`	Yes
Go	`go`	`.go`	Yes
C#	`csharp`	`.cs`	Opt-in
C++	`cpp`	`.cpp`, `.cc`, `.cxx`, `.hpp`, `.hxx`, `.hh`	Opt-in
C	`c`	`.c`, `.h`	Opt-in
PHP	`php`	`.php`	Opt-in
Kotlin	`kotlin`	`.kt`, `.kts`	Opt-in
Swift	`swift`	`.swift`	Opt-in

Feature Flags

Arborist uses Cargo feature flags to control which tree-sitter grammars are compiled. Each language is an independent, optional feature that pulls in its corresponding tree-sitter grammar crate.

To enable all 12 languages, use features = ["all"].

Tiers

Tier 1 (default): Rust, Python, JavaScript, TypeScript, Java, Go -- the most mature tree-sitter grammars. Enabled automatically unless you set default-features = false.
Tier 2 (opt-in): C#, C++, C, PHP, Kotlin, Swift -- require enabling their feature flag explicitly or using the all composite feature.

Individual features

Feature flag	Language	Tier	tree-sitter crate
`rust`	Rust	1	`tree-sitter-rust`
`python`	Python	1	`tree-sitter-python`
`javascript`	JavaScript	1	`tree-sitter-javascript`
`typescript`	TypeScript	1	`tree-sitter-typescript`
`java`	Java	1	`tree-sitter-java`
`go`	Go	1	`tree-sitter-go`
`csharp`	C#	2	`tree-sitter-c-sharp`
`cpp`	C++	2	`tree-sitter-cpp`
`c`	C	2	`tree-sitter-c`
`php`	PHP	2	`tree-sitter-php`
`kotlin`	Kotlin	2	`tree-sitter-kotlin-ng`
`swift`	Swift	2	`tree-sitter-swift`

Composite features

Feature	Expands to
`default`	`rust`, `python`, `javascript`, `typescript`, `java`, `go`
`all`	All 12 languages (`default` + `csharp`, `cpp`, `c`, `php`, `kotlin`, `swift`)

Compile-time note: Each grammar adds compile time and binary size. Use default-features = false with only the languages you need for minimal builds, or all when you need broad language coverage.

Installation

Add to your Cargo.toml:

# Default features (Tier 1): Rust, Python, JavaScript, TypeScript, Java, Go
[dependencies]
arborist-metrics = "0.1"

Select specific languages to reduce compile time:

[dependencies]
arborist-metrics = { version = "0.1", default-features = false, features = ["rust", "python"] }

Enable all 12 languages:

[dependencies]
arborist-metrics = { version = "0.1", features = ["all"] }

Quick Start

Analyze a file

use arborist::{analyze_file, FileReport};

fn main() -> Result<(), arborist::ArboristError> {
    let report: FileReport = analyze_file("src/main.rs")?;

    println!("File: {} ({:?})", report.path, report.language);
    println!("Total cognitive: {}, SLOC: {}", report.file_cognitive, report.file_sloc);

    for func in &report.functions {
        println!("  {} (lines {}-{}): cognitive={}, cyclomatic={}, sloc={}",
            func.name, func.start_line, func.end_line,
            func.cognitive, func.cyclomatic, func.sloc);
    }

    Ok(())
}

Analyze source code from memory

use arborist::{analyze_source, Language};

let source = r#"
def hello(name):
    if name:
        print(f"Hello, {name}!")
    else:
        print("Hello, world!")
"#;

let report = analyze_source(source, Language::Python)?;
// report.functions[0].cognitive == 2 (if + else)
# Ok::<(), arborist::ArboristError>(())

Configure thresholds

use arborist::{analyze_file_with_config, AnalysisConfig};

let config = AnalysisConfig {
    cognitive_threshold: Some(8),
    ..Default::default()
};

let report = analyze_file_with_config("src/complex.rs", &config)?;

for func in &report.functions {
    if func.exceeds_threshold == Some(true) {
        eprintln!("WARNING: {} has cognitive complexity {} (threshold: 8)",
            func.name, func.cognitive);
    }
}
# Ok::<(), arborist::ArboristError>(())

Serialize to JSON

let report = arborist::analyze_file("src/main.rs")?;
let json = serde_json::to_string_pretty(&report)?;
println!("{}", json);
# Ok::<(), Box<dyn std::error::Error>>(())

Metrics

Cognitive Complexity

Follows the SonarSource specification by G. Ann Campbell. Measures how difficult code is to understand:

+1 for each control flow break (if, for, while, match, catch, etc.)
Nesting penalty: nested control flow adds the current nesting depth
Boolean operator sequences: one increment per operator switch (&& to ||)
Flat else if: does not increase nesting
+1 for recursive calls and lambda/closure nesting

Cyclomatic Complexity

Standard McCabe cyclomatic complexity (base 1 + decision points). Measures the number of linearly independent paths through a function.

SLOC

Physical source lines of code, excluding blank lines and comment-only lines.

Contributing

Fork the repository
Create a feature branch
Follow TDD: write fixtures and failing tests before implementation
Run cargo clippy -- -D warnings and cargo test --all-features
Submit a pull request

Adding a new language

Create src/languages/<lang>.rs implementing the LanguageProfile trait
Add the grammar crate as an optional dependency in Cargo.toml
Add a feature flag and wire up detection in src/languages/mod.rs
Create 6 test fixtures in tests/fixtures/<lang>/
Write integration tests

License

Licensed under either of:

at your option.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.agent/workflows		.agent/workflows
.claude		.claude
.cursor/rules		.cursor/rules
.devtrail		.devtrail
.gemini/skills		.gemini/skills
.github		.github
.specify		.specify
examples		examples
scripts		scripts
specs/001-code-metrics-library		specs/001-code-metrics-library
src		src
tests		tests
.cursorrules		.cursorrules
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DEVTRAIL.md		DEVTRAIL.md
GEMINI.md		GEMINI.md
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arborist

Supported Languages

Feature Flags

Tiers

Individual features

Composite features

Installation

Quick Start

Analyze a file

Analyze source code from memory

Configure thresholds

Serialize to JSON

Metrics

Cognitive Complexity

Cyclomatic Complexity

SLOC

Contributing

Adding a new language

License

About

Licenses found

Uh oh!

Releases 1

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Arborist

Supported Languages

Feature Flags

Tiers

Individual features

Composite features

Installation

Quick Start

Analyze a file

Analyze source code from memory

Configure thresholds

Serialize to JSON

Metrics

Cognitive Complexity

Cyclomatic Complexity

SLOC

Contributing

Adding a new language

License

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 1

Contributors

Uh oh!

Languages