feat(logger): add rate limiter by kalyazin · Pull Request #5799 · firecracker-microvm/firecracker

kalyazin · 2026-03-27T11:52:09Z

Changes

Add per-callsite rate limiting for guest-triggered logging paths, following the Linux kernel printk_ratelimited pattern. The error_rate_limited! macro gives each callsite its own independent, preconfigured rate limiter set to 10 messages per 5-second window. When messages are suppressed, a summary is emitted once the callsite resumes logging. A new rate_limited_log_count metric tracks total suppressions.

I was not able to build an integration test that demonstrates that the rate limiting is effective against a real end-to-end scenario because it would've required a custom guest kernel, but I ran an ad hoc experiment by inserting an extra error_rate_limited! line into the balloon
inflate descriptor processing loop (hot path) and saw that it was rate-limited from 128 lines to 10 as expected.

Reason

Guest VMs can trigger repeated error!() calls through various virtio device paths (balloon, net, block, PCI, MMIO). Under sustained error conditions, this leads to excessive disk I/O and CPU consumption on the host from synchronous log writes.

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license. For more information on following Developer
Certificate of Origin and signing off your commits, please check
CONTRIBUTING.md.

PR Checklist

This functionality cannot be added in rust-vmm.

codecov · 2026-03-27T11:57:44Z

Codecov Report

❌ Patch coverage is 54.23729% with 27 lines in your changes missing coverage. Please review.
✅ Project coverage is 83.07%. Comparing base (8ddea25) to head (ecae71f).
⚠️ Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
src/firecracker/src/main.rs	0.00%	10 Missing ⚠️
src/vmm/src/signal_handler.rs	0.00%	5 Missing ⚠️
src/firecracker/src/api_server_adapter.rs	0.00%	4 Missing ⚠️
src/firecracker/src/metrics.rs	0.00%	4 Missing ⚠️
src/firecracker/src/api_server/mod.rs	83.33%	1 Missing ⚠️
src/firecracker/src/api_server/parsed_request.rs	83.33%	1 Missing ⚠️
src/vmm/src/builder.rs	0.00%	1 Missing ⚠️
src/vmm/src/logger/mod.rs	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5799      +/-   ##
==========================================
+ Coverage   83.04%   83.07%   +0.03%     
==========================================
  Files         275      276       +1     
  Lines       29528    29541      +13     
==========================================
+ Hits        24521    24541      +20     
+ Misses       5007     5000       -7

Flag	Coverage Δ
5.10-m5n.metal	`83.40% <57.14%> (+0.03%)`	⬆️
5.10-m6a.metal	`82.73% <57.14%> (+0.03%)`	⬆️
5.10-m6g.metal	`79.99% <54.23%> (+0.03%)`	⬆️
5.10-m6i.metal	`83.40% <57.14%> (+0.03%)`	⬆️
5.10-m7a.metal-48xl	`82.72% <57.14%> (+0.02%)`	⬆️
5.10-m7g.metal	`79.99% <54.23%> (+0.03%)`	⬆️
5.10-m7i.metal-24xl	`83.37% <57.14%> (+0.02%)`	⬆️
5.10-m7i.metal-48xl	`83.37% <57.14%> (+0.02%)`	⬆️
5.10-m8g.metal-24xl	`79.99% <54.23%> (+0.03%)`	⬆️
5.10-m8g.metal-48xl	`79.99% <54.23%> (+0.03%)`	⬆️
5.10-m8i.metal-48xl	`83.37% <57.14%> (+0.02%)`	⬆️
5.10-m8i.metal-96xl	`83.37% <57.14%> (+0.02%)`	⬆️
6.1-m5n.metal	`83.42% <57.14%> (+0.02%)`	⬆️
6.1-m6a.metal	`82.75% <57.14%> (+0.02%)`	⬆️
6.1-m6g.metal	`79.99% <54.23%> (+0.02%)`	⬆️
6.1-m6i.metal	`83.42% <57.14%> (+0.02%)`	⬆️
6.1-m7a.metal-48xl	`82.75% <57.14%> (+0.02%)`	⬆️
6.1-m7g.metal	`79.99% <54.23%> (+0.03%)`	⬆️
6.1-m7i.metal-24xl	`83.43% <57.14%> (+0.02%)`	⬆️
6.1-m7i.metal-48xl	`83.43% <57.14%> (+0.02%)`	⬆️
6.1-m8g.metal-24xl	`79.99% <54.23%> (+0.03%)`	⬆️
6.1-m8g.metal-48xl	`79.99% <54.23%> (+0.03%)`	⬆️
6.1-m8i.metal-48xl	`83.43% <57.14%> (+0.02%)`	⬆️
6.1-m8i.metal-96xl	`83.43% <57.14%> (+0.02%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Manciukic

one more small issue but overall LGTM!

Add a per-callsite rate limiter for logging that wraps the existing TokenBucket in OnceLock<Mutex<...>>. Each macro invocation site gets its own independent LogRateLimiter via a static, so flooding one callsite does not suppress unrelated log messages. Default configuration: 10 messages per 5-second refill period, matching the Linux kernel printk_ratelimited defaults. Include unit tests for burst enforcement, callsite independence, and token refill after the configured period. Signed-off-by: Nikita Kalyazin <kalyazin@amazon.com>

Redefine the error, warn, and info macros re-exported from crate::logger to include per-callsite rate limiting. The original unrestricted log macros are available as error_unrestricted, warn_unrestricted, and info_unrestricted for callsites that must not be rate limited. Each macro checks log_enabled before touching the rate limiter to avoid overhead for filtered-out log levels. Per-callsite suppression counting via a static AtomicU64 reports the number of suppressed messages at warn level when logging resumes. Add rate_limited_log_count metric to LoggerSystemMetrics and update fcmetrics.py accordingly. Signed-off-by: Nikita Kalyazin <kalyazin@amazon.com>

Add clippy.toml to the vmm crate with disallowed-macros configuration that prevents direct use of log::error, log::warn, log::info, and log::debug. This ensures all log callsites go through the crate::logger wrappers rather than calling log macros directly. The rate-limited and unrestricted macro implementations use allow(clippy::disallowed_macros) internally since they must call the underlying log macros. Signed-off-by: Nikita Kalyazin <kalyazin@amazon.com>

Document the new per-callsite rate-limited logging feature in the changelog. Signed-off-by: Nikita Kalyazin <kalyazin@amazon.com>

ShadowCurse · 2026-04-14T15:36:28Z

Resolved 5 lines of conflicts, but GH shows the whole PR in the compare section for some reason.

The per-callsite rate-limited logging feature [1] added a static LogRateLimiter (OnceLock<Mutex<TokenBucket>>) and a static AtomicU64 at each error!/warn!/info! callsite. With ~339 callsites in the VMM crate, this increases the RSS footprint of the Firecracker process. Under the maximum configuration (32 vCPUs, PCI enabled, snapshot creation), the memory monitor observed 6.07-6.13 MiB, exceeding the previous 6.0 MiB threshold_snapshot and causing test_all_vcpus_online to fail in the uvm_restored path. Bump threshold_snapshot from 6 MiB to 7 MiB to accommodate the additional static memory from rate-limited logging. [1]: firecracker-microvm#5799 Signed-off-by: Takahiro Itazuri <itazur@amazon.com>

kalyazin force-pushed the log_rate_limiter branch from 2325c61 to 3ddd1f5 Compare March 27, 2026 11:52

kalyazin force-pushed the log_rate_limiter branch 2 times, most recently from 0240225 to eb60521 Compare March 27, 2026 12:33

kalyazin marked this pull request as ready for review March 27, 2026 12:33

kalyazin requested review from Manciukic and pb8o as code owners March 27, 2026 12:33

kalyazin self-assigned this Mar 27, 2026

kalyazin added the Status: Awaiting review Indicates that a pull request is ready to be reviewed label Mar 27, 2026

ShadowCurse reviewed Mar 27, 2026

View reviewed changes

Comment thread src/vmm/src/devices/virtio/balloon/device.rs Outdated

Comment thread src/vmm/src/logger/rate_limited.rs Outdated

ShadowCurse reviewed Mar 27, 2026

View reviewed changes

Comment thread src/vmm/src/logger/rate_limited.rs Outdated

Manciukic reviewed Mar 27, 2026

View reviewed changes

Comment thread src/vmm/src/logger/rate_limited.rs

Manciukic reviewed Mar 27, 2026

View reviewed changes

Comment thread src/vmm/src/logger/rate_limited.rs

kalyazin force-pushed the log_rate_limiter branch 3 times, most recently from 531998b to 80580f3 Compare March 30, 2026 14:40

ShadowCurse reviewed Mar 30, 2026

View reviewed changes

Comment thread src/vmm/src/logger/mod.rs Outdated

ShadowCurse reviewed Mar 30, 2026

View reviewed changes

Comment thread CHANGELOG.md Outdated

kalyazin force-pushed the log_rate_limiter branch from 80580f3 to b795a7b Compare March 30, 2026 15:47

Manciukic reviewed Mar 30, 2026

View reviewed changes

Comment thread src/vmm/src/logger/mod.rs

Comment thread src/vmm/src/logger/rate_limited.rs

ilstam reviewed Apr 1, 2026

View reviewed changes

Comment thread src/vmm/src/logger/rate_limited.rs

kalyazin force-pushed the log_rate_limiter branch 2 times, most recently from 0514643 to d5835aa Compare April 2, 2026 11:49

Manciukic reviewed Apr 2, 2026

View reviewed changes

Comment thread src/vmm/src/arch/aarch64/vcpu.rs Outdated

Comment thread src/vmm/src/logger/mod.rs

ShadowCurse reviewed Apr 2, 2026

View reviewed changes

Comment thread src/vmm/src/logger/rate_limited.rs Outdated

Comment thread src/vmm/src/logger/mod.rs

Comment thread src/vmm/src/arch/aarch64/vcpu.rs Outdated

kalyazin force-pushed the log_rate_limiter branch 5 times, most recently from 0e11369 to 18d9c30 Compare April 8, 2026 09:25

kalyazin force-pushed the log_rate_limiter branch 5 times, most recently from ade2a4d to 978f04d Compare April 13, 2026 13:08

kalyazin requested review from Manciukic and ShadowCurse April 13, 2026 13:39

ShadowCurse reviewed Apr 13, 2026

View reviewed changes

Comment thread src/vmm/src/acpi/mod.rs Outdated

kalyazin force-pushed the log_rate_limiter branch from 978f04d to 1a02189 Compare April 13, 2026 14:33

ShadowCurse previously approved these changes Apr 13, 2026

View reviewed changes

Manciukic reviewed Apr 13, 2026

View reviewed changes

Comment thread src/vmm/src/signal_handler.rs

kalyazin dismissed ShadowCurse’s stale review via 1f0602b April 13, 2026 15:51

kalyazin force-pushed the log_rate_limiter branch from 1a02189 to 1f0602b Compare April 13, 2026 15:51

ShadowCurse reviewed Apr 13, 2026

View reviewed changes

Comment thread src/vmm/src/logger/mod.rs

ShadowCurse previously approved these changes Apr 13, 2026

View reviewed changes

Manciukic previously approved these changes Apr 14, 2026

View reviewed changes

ShadowCurse enabled auto-merge (rebase) April 14, 2026 15:03

kalyazin added 4 commits April 14, 2026 16:29

changelog: add entry for per-callsite rate-limited logging

ecae71f

Document the new per-callsite rate-limited logging feature in the changelog. Signed-off-by: Nikita Kalyazin <kalyazin@amazon.com>

ShadowCurse dismissed stale reviews from Manciukic and themself via ecae71f April 14, 2026 15:33

ShadowCurse force-pushed the log_rate_limiter branch from 1f0602b to ecae71f Compare April 14, 2026 15:33

ShadowCurse approved these changes Apr 14, 2026

View reviewed changes

Manciukic approved these changes Apr 14, 2026

View reviewed changes

ShadowCurse merged commit a481d51 into firecracker-microvm:main Apr 14, 2026
6 of 7 checks passed

zulinx86 mentioned this pull request Apr 16, 2026

chore: Increase snapshot memory threshold from 6 to 7 MiB #5843

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(logger): add rate limiter#5799

feat(logger): add rate limiter#5799
ShadowCurse merged 4 commits intofirecracker-microvm:mainfrom
kalyazin:log_rate_limiter

kalyazin commented Mar 27, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Manciukic left a comment

Uh oh!

Uh oh!

Uh oh!

ShadowCurse commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kalyazin commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reason

License Acceptance

PR Checklist

Uh oh!

codecov bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Manciukic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ShadowCurse commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kalyazin commented Mar 27, 2026 •

edited

Loading

codecov bot commented Mar 27, 2026 •

edited

Loading