Skip to content

Add --batch-size flag to CLI #2

@pipe1os

Description

@pipe1os

Description

The calculate_footprint function already supports a batch_size argument to scale dynamic KV cache calculations, but there is no CLI flag to expose this.

We need to add a --batch-size argument to cli.py (default: 1) and pass it through analyze_model to calculate_footprint.

Use Case

Allows users to calculate accurate VRAM footprints for multi-batch inference without editing code.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions