fix: process batch RPC request in parallel by hanabi1224 · Pull Request #7093 · ChainSafe/forest

hanabi1224 · 2026-05-21T16:31:08Z

Summary of changes

Changes introduced in this pull request:

Reference issue to close (if applicable)

Closes #7092

Change checklist

I have performed a self-review of my own code,
I have made corresponding changes to the documentation. All new code adheres to the team's documentation standards,
I have added tests that prove my fix is effective or that my feature works (if possible),
I have made sure the CHANGELOG is up-to-date. All user-facing changes should be reflected in this document.

Outside contributions

I have read and agree to the CONTRIBUTING document.
I have read and agree to the AI Policy document. I understand that failure to comply with the guidelines will lead to rejection of the pull request.

Summary by CodeRabbit

New Features
- RPC batch requests are processed in parallel to improve throughput while preserving response ordering.
Refactor
- Centralized RPC server configuration to enforce request/response size and connection limits and expose a single max-response size for batch handling.
- RPC client type and constructor made public for external use.
Documentation
- Added an environment variable to control batch concurrency with a default value.
Chores
- Changelog updated to note parallelized RPC batch processing.

coderabbitai · 2026-05-21T16:31:16Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 13c88f7d-193e-4d55-b15a-ce9fadbc46a2

📥 Commits

Reviewing files that changed from the base of the PR and between 5628ce0 and c86a330.

📒 Files selected for processing (2)

src/rpc/client.rs
src/rpc/mod.rs

Walkthrough

Adds ParallelBatchLayer middleware that executes JSON-RPC batch entries concurrently using ordered futures and a size-limited BatchResponseBuilder, exposes a max_response_body_size configuration, and wires the layer into the RPC server stack after MetricsLayer using a shared ServerConfig.

Changes

Parallel Batch Processing

Layer / File(s)	Summary
ParallelBatchLayer type and service definition `src/rpc/parallel_batch_layer.rs`	`ParallelBatchLayer` and `ParallelBatchService<S>` carry `max_response_body_size` configuration, wrap the inner service in an `Arc`, and implement `RpcServiceT` with passthroughs for `call` and `notification`.
Batch concurrent execution logic `src/rpc/parallel_batch_layer.rs`	`RpcServiceT::batch` schedules batch entries into an ordered futures queue, limits concurrency via a semaphore from `FOREST_RPC_BATCH_MAX_CONCURRENCY`, maps per-entry outcomes into `MethodResponse` (or `None` for notifications), appends call results to a size-limited `BatchResponseBuilder` with early return on append failure, and returns either an empty notification-only response or `MethodResponse::from_batch(...)`.
RPC server integration and middleware wiring `src/rpc/mod.rs`, `src/rpc/client.rs`, `CHANGELOG.md`, `docs/docs/users/reference/env_variables.md`	Adds `parallel_batch_layer` module and import; extracts a reusable `ServerConfig`, derives `max_response_body_size`, inserts `ParallelBatchLayer::new(max_response_body_size)` immediately after `MetricsLayer` as the final middleware, makes `UrlClient` and its constructor public, extends RPC tests to send and validate an HTTP batch request, updates changelog, and documents `FOREST_RPC_BATCH_MAX_CONCURRENCY`.

Sequence Diagram

sequenceDiagram
  participant Client
  participant ParallelBatchService
  participant InnerService
  participant BatchResponseBuilder
  Client->>ParallelBatchService: send batch (array of entries)
  ParallelBatchService->>InnerService: spawn per-entry tasks (bounded by semaphore)
  InnerService-->>ParallelBatchService: per-entry MethodResponse | None (notification)
  ParallelBatchService->>BatchResponseBuilder: append call responses in original order
  ParallelBatchService-->>Client: return batch response or notification-only response

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

ChainSafe/forest#6206: Modifies JSON-RPC batch request handling in middleware; related to batch processing behavior and middleware changes.
ChainSafe/forest#6528: Changes JSON-RPC ServerConfig/max connections plumbing; touches the same server config area adjusted here.
ChainSafe/forest#7025: Touches JSON-RPC max_response_body_size configuration which is reused to parameterize ParallelBatchLayer.

Suggested reviewers

LesnyRumcajs
sudo-shashank
akaladarshi

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely summarizes the primary change: implementing parallel processing for JSON-RPC batch requests, matching the main objective from issue `#7092`.
Linked Issues check	✅ Passed	The PR successfully implements all coding objectives from issue `#7092`: batch requests are parallelized via ParallelBatchLayer middleware, concurrency is configurable via FOREST_RPC_BATCH_MAX_CONCURRENCY environment variable, and middleware-based implementation avoids forking jsonrpsee.
Out of Scope Changes check	✅ Passed	All changes are directly related to implementing batch RPC parallelization: new middleware layer, module integration, environment variable documentation, UrlClient visibility for testing, and changelog updates are all in scope.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch hm/rpc-process-batch-in-parallel

✨ Simplify code

Create PR with simplified code
Commit simplified code in branch hm/rpc-process-batch-in-parallel

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

LesnyRumcajs · 2026-05-21T16:39:26Z

+};
+use tower::Layer;
+
+// State-less jsonrpcsee layer for measuring RPC metrics


coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/rpc/parallel_batch_layer.rs`:
- Around line 58-85: The loop in parallel_batch_layer::batch schedules every
BatchEntry immediately via tasks (FuturesOrdered) which can create unbounded
in-flight work; add a configurable concurrency cap (e.g., a semaphore or a
bounded buffer) referenced from the struct (introduce a field like
concurrency_limit or semaphore) and acquire a permit before spawning each task
for self.service.call(...) or service.notification(...), releasing the permit
when the spawned future completes, so you still push the task into tasks
(FuturesOrdered) for response assembly but never exceed the configured in-flight
count; ensure Err branch (MethodResponse::error(...)) still gets pushed
immediately without consuming a permit if you prefer, or respect the same cap to
be consistent.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 0f6fea72-6328-4972-8676-b80d11aa4797

📥 Commits

Reviewing files that changed from the base of the PR and between 62c0289 and 51d0b77.

📒 Files selected for processing (2)

src/rpc/mod.rs
src/rpc/parallel_batch_layer.rs

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/rpc/parallel_batch_layer.rs (1)
85-123: ⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Avoid returning on batch_rp.append(...) error while tasks (FuturesOrdered) still owns in-flight RPC futures.
The early return err drops tasks, and dropping a FuturesOrdered cancels/drops all futures it still manages; since call/notification futures are running under the concurrency semaphore, some entries may be aborted mid-flight and only the overflow error is returned to the client. Drain/poll the remaining tasks until empty (ignoring/short-circuiting further appends as needed) before returning the error.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/rpc/parallel_batch_layer.rs` around lines 85 - 123, The loop currently
returns early when batch_rp.append(...) fails, which drops tasks (the
FuturesOrdered) and cancels in-flight call/notification futures; instead,
capture the append error into a local Option<ErrType> (or a flag) when
MethodResponse append fails, but do not return immediately—continue
polling/draining tasks (the variable tasks) until it is exhausted so
semaphore-backed futures (service.call/service.notification) can complete or be
polled to completion, and after the while loop returns the captured error if
any; update the async block containing tasks.next().await to implement this
drain-and-lazy-error-return behavior while still creating the error
MethodResponse via MethodResponse::error(...) for Err(BatchEntry) cases.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/rpc/parallel_batch_layer.rs`:
- Around line 67-80: The semaphore is being created inside batch() so each
request gets a fresh limiter; move the limiter out to be a shared field on the
long-lived type (e.g., add a field like semaphore: Arc<Semaphore> on
ParallelBatchLayer or ParallelBatchService and initialize it once using
MAX_CONCURRENCY / MAX_CONCURRENCY_ENV during construction) and then clone that
Arc inside batch() instead of creating a new Semaphore there; alternatively, if
the intent is per-batch limiting, rename the local variable and
MAX_CONCURRENCY_ENV to make that explicit (e.g., per-batch max) so callers know
it is not global.

---

Outside diff comments:
In `@src/rpc/parallel_batch_layer.rs`:
- Around line 85-123: The loop currently returns early when batch_rp.append(...)
fails, which drops tasks (the FuturesOrdered) and cancels in-flight
call/notification futures; instead, capture the append error into a local
Option<ErrType> (or a flag) when MethodResponse append fails, but do not return
immediately—continue polling/draining tasks (the variable tasks) until it is
exhausted so semaphore-backed futures (service.call/service.notification) can
complete or be polled to completion, and after the while loop returns the
captured error if any; update the async block containing tasks.next().await to
implement this drain-and-lazy-error-return behavior while still creating the
error MethodResponse via MethodResponse::error(...) for Err(BatchEntry) cases.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: a192deef-805b-4c77-97cf-25c1bd52f98b

📥 Commits

Reviewing files that changed from the base of the PR and between df7758b and 5628ce0.

📒 Files selected for processing (3)

CHANGELOG.md
docs/docs/users/reference/env_variables.md
src/rpc/parallel_batch_layer.rs

✅ Files skipped from review due to trivial changes (2)

docs/docs/users/reference/env_variables.md
CHANGELOG.md

coderabbitai · 2026-05-21T17:24:14Z

+        const MAX_CONCURRENCY_ENV: &str = "FOREST_RPC_BATCH_MAX_CONCURRENCY";
+        static MAX_CONCURRENCY: LazyLock<NonZeroUsize> = LazyLock::new(|| {
+            std::env::var(MAX_CONCURRENCY_ENV)
+                .ok()
+                .and_then(|i| i.parse().ok())
+                .inspect(|i| {
+                    tracing::info!(
+                        "Max RPC batch concurrency is set to {i} by {MAX_CONCURRENCY_ENV}"
+                    )
+                })
+                .unwrap_or(nonzero!(8usize))
+        });
+        let semaphore = Arc::new(Semaphore::new(MAX_CONCURRENCY.get()));
+        let mut batch_rp = BatchResponseBuilder::new_with_limit(self.max_response_body_size);


⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Share the limiter across batch invocations, or label it as per-batch.

Each batch() call creates a fresh Semaphore, so FOREST_RPC_BATCH_MAX_CONCURRENCY=8 still allows 8 × concurrent_batches in-flight RPCs. If this knob is meant to protect the node globally, it needs to live on ParallelBatchLayer/ParallelBatchService and be cloned, not rebuilt per request.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/rpc/parallel_batch_layer.rs` around lines 67 - 80, The semaphore is being created inside batch() so each request gets a fresh limiter; move the limiter out to be a shared field on the long-lived type (e.g., add a field like semaphore: Arc<Semaphore> on ParallelBatchLayer or ParallelBatchService and initialize it once using MAX_CONCURRENCY / MAX_CONCURRENCY_ENV during construction) and then clone that Arc inside batch() instead of creating a new Semaphore there; alternatively, if the intent is per-batch limiting, rename the local variable and MAX_CONCURRENCY_ENV to make that explicit (e.g., per-batch max) so callers know it is not global.

This is intended for now. Will make the change if testing proves we do need a global Semaphore

@hanabi1224, understood — the per-batch concurrency limit is intentional for now, acting as an intra-batch throttle rather than a global one. That's a reasonable starting point. Would you like me to open a follow-up issue to track the potential migration to a shared semaphore once you have performance data?

(°ᴥ° )～

✏️ Learnings added

Learnt from: hanabi1224 Repo: ChainSafe/forest PR: 7093 File: src/rpc/parallel_batch_layer.rs:67-80 Timestamp: 2026-05-21T17:26:37.270Z Learning: In `src/rpc/parallel_batch_layer.rs` (ChainSafe/forest), the `Semaphore` inside `ParallelBatchService::batch()` is intentionally created per-batch call to limit concurrency within a single batch. It is NOT intended to be a global limiter across all concurrent batches. The `FOREST_RPC_BATCH_MAX_CONCURRENCY` environment variable (default 8) therefore controls per-batch parallelism, not node-wide RPC concurrency. A migration to a shared/global semaphore is deferred until testing proves it necessary.

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

🧠 Learnings used

Learnt from: hanabi1224 Repo: ChainSafe/forest PR: 6381 File: src/lotus_json/actors/states/cron_state.rs:8-8 Timestamp: 2026-01-05T12:54:40.850Z Learning: In Rust code reviews, do not derive Eq for a struct if any field does not implement Eq (e.g., types from external dependencies). If a type like CronStateLotusJson includes fields wrapping external dependencies that lack Eq, derive PartialEq (or implement PartialEq manually) but avoid deriving Eq. This ensures comparisons compile and reflect actual equivalence semantics. When needed, consider implementing custom PartialEq (and possibly Eq) only after ensuring all fields (or wrappers) implement Eq, or keep PartialEq-only if full equality semantics cannot be expressed.

Learnt from: hanabi1224 Repo: ChainSafe/forest PR: 6381 File: src/lotus_json/actors/states/evm_state.rs:41-44 Timestamp: 2026-01-05T12:56:13.802Z Learning: In Rust codebases (e.g., Forest), do not add #[cfg(test)] to functions already annotated with #[test]. The #[test] attribute ensures the function is compiled only for tests, so a separate #[cfg(test)] is redundant and can be removed if present. Apply this check to all Rust files that contain #[test] functions.

Learnt from: hanabi1224 Repo: ChainSafe/forest PR: 6666 File: src/tool/subcommands/archive_cmd.rs:628-631 Timestamp: 2026-03-02T09:43:34.946Z Learning: In Rust sources related to Forest snapshot handling, implement the rule: enable message_receipts and events (message_receipts: true, events: true) only for GC snapshots as defined in src/db/gc/snapshot.rs, since these are internal snapshots created during garbage collection. For user-facing export commands such as src/tool/subcommands/archive_cmd.rs, disable receipts and events by default (message_receipts: false, events: false) to keep user-facing snapshots smaller, unless explicitly requested. This guidance targets Rust files; apply consistently across similar snapshot-related modules, using the narrowest applicable scope when extending beyond the two specified files.

Learnt from: LesnyRumcajs Repo: ChainSafe/forest PR: 6903 File: src/shim/executor.rs:243-251 Timestamp: 2026-04-14T17:24:13.631Z Learning: In the ChainSafe/forest (LesnyRumcajs) codebase, do not require rustdoc comments on public methods when the method’s purpose is clearly self-explanatory from its name (e.g., `entries(&self)`, `into_entries(self)`). When reviewing Rust (`.rs`) files, avoid flagging missing documentation for these obviously named public APIs, reserving doc requirements for less clear or non-obvious public methods.

Learnt from: hanabi1224 Repo: ChainSafe/forest PR: 7077 File: src/utils/cache/size_tracking.rs:174-182 Timestamp: 2026-05-19T10:55:04.684Z Learning: In ChainSafe/forest code reviews, avoid raising review comments for issues that are likely compile-time failures (e.g., missing imports, unresolved types, or other compiler errors that CI will catch). Instead, focus feedback on logic, correctness, and design/maintainability concerns, since the CI pipeline reliably verifies compilation.

codecov · 2026-05-21T17:32:40Z

Codecov Report

❌ Patch coverage is 72.72727% with 27 lines in your changes missing coverage. Please review.
✅ Project coverage is 64.25%. Comparing base (62c0289) to head (c86a330).
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/rpc/parallel_batch_layer.rs	59.70%	23 Missing and 4 partials ⚠️

Additional details and impacted files

Files with missing lines	Coverage Δ
src/rpc/client.rs	`53.14% <100.00%> (+6.76%)`	⬆️
src/rpc/mod.rs	`88.92% <100.00%> (+0.83%)`	⬆️
src/rpc/parallel_batch_layer.rs	`59.70% <59.70%> (ø)`

... and 10 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 62c0289...c86a330. Read the comment docs.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

hanabi1224 added the RPC requires calibnet RPC checks to run on CI label May 21, 2026

hanabi1224 marked this pull request as ready for review May 21, 2026 16:35

hanabi1224 requested a review from a team as a code owner May 21, 2026 16:35

hanabi1224 requested review from LesnyRumcajs and sudo-shashank and removed request for a team May 21, 2026 16:35

LesnyRumcajs reviewed May 21, 2026

View reviewed changes

coderabbitai Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/rpc/parallel_batch_layer.rs

hanabi1224 force-pushed the hm/rpc-process-batch-in-parallel branch from 51d0b77 to e9030aa Compare May 21, 2026 16:43

fix: process batch RPC request in parallel

df7758b

hanabi1224 force-pushed the hm/rpc-process-batch-in-parallel branch from e9030aa to df7758b Compare May 21, 2026 16:53

hanabi1224 added 2 commits May 22, 2026 01:09

FOREST_RPC_BATCH_MAX_CONCURRENCY

bf427df

changelog

5628ce0

github-actions Bot deployed to preview May 21, 2026 17:11 View deployment

github-actions Bot deployed to preview May 21, 2026 17:14 View deployment

hanabi1224 requested a review from LesnyRumcajs May 21, 2026 17:14

coderabbitai Bot reviewed May 21, 2026

View reviewed changes

cover batch in unit test

c86a330

github-actions Bot deployed to preview May 21, 2026 17:58 View deployment

sudo-shashank approved these changes May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: process batch RPC request in parallel#7093

fix: process batch RPC request in parallel#7093
hanabi1224 wants to merge 4 commits into
mainfrom
hm/rpc-process-batch-in-parallel

hanabi1224 commented May 21, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 21, 2026 •

edited

Loading

Reviews paused

❌ Failed checks (1 warning)

Uh oh!

LesnyRumcajs May 21, 2026

Uh oh!

hanabi1224 May 21, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 21, 2026 •

edited

Loading

Uh oh!

hanabi1224 May 21, 2026

Uh oh!

coderabbitai Bot May 21, 2026

Uh oh!

codecov Bot commented May 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hanabi1224 commented May 21, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary of changes

Reference issue to close (if applicable)

Other information and links

Change checklist

Outside contributions

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

LesnyRumcajs May 21, 2026

Choose a reason for hiding this comment

Uh oh!

hanabi1224 May 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hanabi1224 May 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hanabi1224 commented May 21, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 21, 2026 •

edited

Loading

coderabbitai Bot May 21, 2026 •

edited

Loading

codecov Bot commented May 21, 2026 •

edited

Loading