[Feature] Collector.fake_tensordict() / MultiCollector.fake_tensordict() by vmoens · Pull Request #3764 · pytorch/rl

vmoens · 2026-05-15T08:40:14Z

Stack from ghstack (oldest at bottom):

Public method that returns a zero-filled tensordict shaped exactly like
one batch yielded by the collector, useful for storage initialization
and torch.compile / cudagraph warmup without having to step the env
first.

Collector.fake_tensordict() (single-process):

Reuses the existing _final_rollout template; builds it lazily via
_maybe_make_final_rollout(make_rollout=True) even when
use_buffers=False so the public API is consistent.
Mirrors the rollout post-pipeline: _maybe_attach_final_obs,
_maybe_set_truncated, then _postproc (which runs
split_trajectories, the user postproc, and private-key
exclusion).
Result: env keys + policy out-keys + ("collector", "traj_ids"),
compact_obs exclusions and final_obs UnbatchedTensor
leaves applied, last dim named "time".

MultiCollector.fake_tensordict() raises NotImplementedError.
Honoring the contract on the parent process would either require
creating an env there (which defeats the purpose of a multi-process
collector — Isaac Lab / mujoco-mjx etc. can only run in workers) or
routing a request to a worker over the pipe (which requires live
workers and adds protocol surface). Neither is in scope here; users
who need a fake tensordict can call it on a single-process
:class:~torchrl.collectors.Collector.

Tests pin: shape / names / keys / zero-fill parity between
fake_tensordict() and next(iter(collector)) (with and without
buffers); compact_obs drops ("next", obs) and final_obs
attaches ("final", obs) as UnbatchedTensor; and that
MultiCollector.fake_tensordict() raises NotImplementedError.

[ghstack-poisoned]

pytorch-bot · 2026-05-15T08:40:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3764

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Run pull request jobs on OSDC runners in shadow mode

❌ 3 New Failures, 1 Cancelled Job

As of commit 7ef328e with merge base 0a01ee8 ():

NEW FAILURES - The following jobs have failed:

Build Windows Wheels / pytorch/rl / build-wheel-py3_10-cpu (gh)
Build Windows Wheels / pytorch/rl / upload / upload-wheel-py3_10-cpu (gh)
Unable to download artifact(s): Artifact not found for name: pytorch_rl__3.10_cpu_x64
Unit-tests on Linux / tests-cpu (3.14) / linux-job (gh)
test/objectives/test_dqn.py::TestQMixer::test_dqn_prioritized_weights

CANCELLED JOB - The following job was cancelled. Please retry:

Unit-tests on Windows / unittests-cpu (3.10, windows.4xlarge, cpu) / windows-job (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Public method that returns a zero-filled tensordict shaped exactly like one batch yielded by the collector, useful for storage initialization and ``torch.compile`` / cudagraph warmup without having to step the env first. ``Collector.fake_tensordict()`` (single-process): - Reuses the existing ``_final_rollout`` template; builds it lazily via ``_maybe_make_final_rollout(make_rollout=True)`` even when ``use_buffers=False`` so the public API is consistent. - Mirrors the rollout post-pipeline: ``_maybe_attach_final_obs``, ``_maybe_set_truncated``, then ``_postproc`` (which runs ``split_trajectories``, the user ``postproc``, and private-key exclusion). - Result: env keys + policy out-keys + ``("collector", "traj_ids")``, ``compact_obs`` exclusions and ``final_obs`` ``UnbatchedTensor`` leaves applied, last dim named ``"time"``. ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. Honoring the contract on the parent process would either require creating an env there (which defeats the purpose of a multi-process collector — Isaac Lab / mujoco-mjx etc. can only run in workers) or routing a request to a worker over the pipe (which requires live workers and adds protocol surface). Neither is in scope here; users who need a fake tensordict can call it on a single-process :class:`~torchrl.collectors.Collector`. Tests pin: shape / names / keys / zero-fill parity between ``fake_tensordict()`` and ``next(iter(collector))`` (with and without buffers); ``compact_obs`` drops ``("next", obs)`` and ``final_obs`` attaches ``("final", obs)`` as ``UnbatchedTensor``; and that ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. ghstack-source-id: c2b5dbb Pull-Request: #3764

[ghstack-poisoned]

Public method that returns a zero-filled tensordict shaped exactly like one batch yielded by the collector, useful for storage initialization and ``torch.compile`` / cudagraph warmup without having to step the env first. ``Collector.fake_tensordict()`` (single-process): - Reuses the existing ``_final_rollout`` template; builds it lazily via ``_maybe_make_final_rollout(make_rollout=True)`` even when ``use_buffers=False`` so the public API is consistent. - Mirrors the rollout post-pipeline: ``_maybe_attach_final_obs``, ``_maybe_set_truncated``, then ``_postproc`` (which runs ``split_trajectories``, the user ``postproc``, and private-key exclusion). - Result: env keys + policy out-keys + ``("collector", "traj_ids")``, ``compact_obs`` exclusions and ``final_obs`` ``UnbatchedTensor`` leaves applied, last dim named ``"time"``. ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. Honoring the contract on the parent process would either require creating an env there (which defeats the purpose of a multi-process collector — Isaac Lab / mujoco-mjx etc. can only run in workers) or routing a request to a worker over the pipe (which requires live workers and adds protocol surface). Neither is in scope here; users who need a fake tensordict can call it on a single-process :class:`~torchrl.collectors.Collector`. Tests pin: shape / names / keys / zero-fill parity between ``fake_tensordict()`` and ``next(iter(collector))`` (with and without buffers); ``compact_obs`` drops ``("next", obs)`` and ``final_obs`` attaches ``("final", obs)`` as ``UnbatchedTensor``; and that ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. ghstack-source-id: 38de918 Pull-Request: #3764

[ghstack-poisoned]

Public method that returns a zero-filled tensordict shaped exactly like one batch yielded by the collector, useful for storage initialization and ``torch.compile`` / cudagraph warmup without having to step the env first. ``Collector.fake_tensordict()`` (single-process): - Reuses the existing ``_final_rollout`` template; builds it lazily via ``_maybe_make_final_rollout(make_rollout=True)`` even when ``use_buffers=False`` so the public API is consistent. - Mirrors the rollout post-pipeline: ``_maybe_attach_final_obs``, ``_maybe_set_truncated``, then ``_postproc`` (which runs ``split_trajectories``, the user ``postproc``, and private-key exclusion). - Result: env keys + policy out-keys + ``("collector", "traj_ids")``, ``compact_obs`` exclusions and ``final_obs`` ``UnbatchedTensor`` leaves applied, last dim named ``"time"``. ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. Honoring the contract on the parent process would either require creating an env there (which defeats the purpose of a multi-process collector — Isaac Lab / mujoco-mjx etc. can only run in workers) or routing a request to a worker over the pipe (which requires live workers and adds protocol surface). Neither is in scope here; users who need a fake tensordict can call it on a single-process :class:`~torchrl.collectors.Collector`. Tests pin: shape / names / keys / zero-fill parity between ``fake_tensordict()`` and ``next(iter(collector))`` (with and without buffers); ``compact_obs`` drops ``("next", obs)`` and ``final_obs`` attaches ``("final", obs)`` as ``UnbatchedTensor``; and that ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. ghstack-source-id: f46dbe7 Pull-Request: #3764

[ghstack-poisoned]

Public method that returns a zero-filled tensordict shaped exactly like one batch yielded by the collector, useful for storage initialization and ``torch.compile`` / cudagraph warmup without having to step the env first. ``Collector.fake_tensordict()`` (single-process): - Reuses the existing ``_final_rollout`` template; builds it lazily via ``_maybe_make_final_rollout(make_rollout=True)`` even when ``use_buffers=False`` so the public API is consistent. - Mirrors the rollout post-pipeline: ``_maybe_attach_final_obs``, ``_maybe_set_truncated``, then ``_postproc`` (which runs ``split_trajectories``, the user ``postproc``, and private-key exclusion). - Result: env keys + policy out-keys + ``("collector", "traj_ids")``, ``compact_obs`` exclusions and ``final_obs`` ``UnbatchedTensor`` leaves applied, last dim named ``"time"``. ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. Honoring the contract on the parent process would either require creating an env there (which defeats the purpose of a multi-process collector — Isaac Lab / mujoco-mjx etc. can only run in workers) or routing a request to a worker over the pipe (which requires live workers and adds protocol surface). Neither is in scope here; users who need a fake tensordict can call it on a single-process :class:`~torchrl.collectors.Collector`. Tests pin: shape / names / keys / zero-fill parity between ``fake_tensordict()`` and ``next(iter(collector))`` (with and without buffers); ``compact_obs`` drops ``("next", obs)`` and ``final_obs`` attaches ``("final", obs)`` as ``UnbatchedTensor``; and that ``MultiCollector.fake_tensordict()`` raises ``NotImplementedError``. ghstack-source-id: 6db6382 Pull-Request: #3764

Update

03b3696

[ghstack-poisoned]

github-actions Bot added Feature New feature Collectors Integrations/torch_geometric Integrations labels May 15, 2026

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 15, 2026

Update

5fa63dd

[ghstack-poisoned]

Update

4265074

[ghstack-poisoned]

Update

79d28c9

[ghstack-poisoned]

This was referenced May 15, 2026

[Refactor] Keep [B, T] dim in value estimators #3767

Merged

[Refactor] Simplify LSTM/GRUModule recurrent-mode shape normalization #3768

Merged

[Example] Add Isaac RNN PPO rollout mode flags #3769

Merged

Update

ec7113e

[ghstack-poisoned]

This was referenced May 17, 2026

[Test] Enable scan compile RNN tests on Windows #3770

Closed

[BugFix] Fix GAE compact path bias on recurrent value nets at internal truncations #3771

Merged

Update

65ace9c

[ghstack-poisoned]

This was referenced May 18, 2026

[Example] Expose compact GAE cat dimension #3775

Merged

[Doc] Migrate shifted=True callers to legacy/compact + docstring polish #3776

Merged

Update

7ef328e

[ghstack-poisoned]

vmoens merged commit 7ef328e into gh/vmoens/280/base May 18, 2026
107 of 113 checks passed

vmoens deleted the gh/vmoens/280/head branch May 18, 2026 21:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Collector.fake_tensordict() / MultiCollector.fake_tensordict()#3764

[Feature] Collector.fake_tensordict() / MultiCollector.fake_tensordict()#3764
vmoens merged 7 commits into
gh/vmoens/280/basefrom
gh/vmoens/280/head

vmoens commented May 15, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented May 15, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vmoens commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3764

❗ 1 Active SEVs

❌ 3 New Failures, 1 Cancelled Job

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vmoens commented May 15, 2026 •

edited

Loading

pytorch-bot Bot commented May 15, 2026 •

edited

Loading