Cosmos Transfer2.5 Auto-Regressive Inference Pipeline by miguelmartin75 · Pull Request #13114 · huggingface/diffusers

miguelmartin75 · 2026-02-10T02:47:36Z

What does this PR do?

This builds off #13066 by adding auto-regressive inference for Cosmos Transfer2.5. This pipeline does not require the controlnet or controls to be input. From the documentation:

The call function can be used in two modes: with or without controls.
When controls are not provided (controls is None), inference works in the same manner as predict2.5 (see
Cosmos2_5_PredictPipeline). This mode strictly uses the base transformer (self.transformer) to perform
inference and accepts as input an optional image or video along with a prompt / negative_prompt, and
can be used in the following ways:
- Text2World: image=None, video=None, prompt provided.
- Image2World: image provided, video=None, prompt provided.
- Video2World: video provided, image=None, prompt provided.
When controls are provided and a ControlNet is attached, controls drive the conditioning and video &
image is ignored. Controls are assumed to be pre-processed, e.g. edge maps are pre-computed.
Setting num_frames will restrict the total number of frames output, if not provided or assigned to None
(default) then the number of output frames will match the input video, image or controls respectively.
Auto-regressive inference is supported and thus a sliding window of num_frames_per_chunk frames are used per
denoising loop. In addition, when auto-regressive inference is performed, the previous
num_latent_conditional_frames or num_conditional_frames are used to condition the following denoising
inference loops.

Who can review?

Pipelines: @yiyixuxu
Docs: @stevhliu and @sayakpaul

yiyixuxu

thanks for the PR!

my main question is would it make sense to make this pipeline strictly ControlNet-focused? looking at the pipeline code, this would simplify the pipeline quite a bit

yiyixuxu · 2026-02-18T17:57:49Z

src/diffusers/pipelines/cosmos/pipeline_cosmos2_5_transfer.py

        self,
        image: PipelineImageInput | None = None,
        video: List[PipelineImageInput] | None = None,
+        controls: Optional[PipelineImageInput | List[PipelineImageInput]] = None,


maybe we should make this pipeline strictly about controlnet (i.e. not make it optional) and then remove the image and video argument? this is how other controlnet behave anyways
if they want to use without controlnet, they can switich to the base pipeline

yiyixuxu · 2026-02-18T18:32:01Z

src/diffusers/pipelines/cosmos/pipeline_cosmos2_5_transfer.py

            else:
                width = int((height + 16) * (frame.shape[2] / frame.shape[1]))  # NOTE: assuming C H W

+        if num_latent_conditional_frames is not None and num_conditional_frames is not None:


any reason we need two arguments here? is it possible we only keep num_conditional_frames?

this is done to provide the user with the option to provide either one, the official GH uses num_conditional_frames for transfer but num_latent_conditional_frames for predict so I figured it would be best to provide both

miguelmartin75 force-pushed the cosmos/transfer2.5-ar branch 3 times, most recently from 775f4b8 to 0d0eeae Compare February 12, 2026 22:19

AR

bff6af9

miguelmartin75 force-pushed the cosmos/transfer2.5-ar branch from 0d0eeae to bff6af9 Compare February 12, 2026 22:24

miguelmartin75 mentioned this pull request Feb 17, 2026

Cosmos Transfer2.5 inference pipeline: general/{seg, depth, blur, edge} #13066

Merged

Update docs

7f89af8

yiyixuxu reviewed Feb 18, 2026

View reviewed changes

remove non-control path

16602ad

miguelmartin75 force-pushed the cosmos/transfer2.5-ar branch from 55b09c0 to 16602ad Compare February 20, 2026 02:43

sayakpaul requested a review from DN6 February 20, 2026 04:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cosmos Transfer2.5 Auto-Regressive Inference Pipeline#13114

Cosmos Transfer2.5 Auto-Regressive Inference Pipeline#13114
miguelmartin75 wants to merge 3 commits intohuggingface:mainfrom
miguelmartin75:cosmos/transfer2.5-ar

miguelmartin75 commented Feb 10, 2026 •

edited

Loading

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu Feb 18, 2026

Uh oh!

yiyixuxu Feb 18, 2026

Uh oh!

miguelmartin75 Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

miguelmartin75 commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who can review?

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

miguelmartin75 Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

miguelmartin75 commented Feb 10, 2026 •

edited

Loading