support want2v lora train by gushiqiao · Pull Request #1148 · ModelTC/LightX2V

gushiqiao · 2026-06-12T08:32:11Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces support for WanT2V training and inference, adding new datasets (WanT2VVideoDataset, WanT2VCachedDataset, PromptDataset), a WanT2VInferencer, a WanT2VModel, and dynamic time-shifting scheduling. Feedback focuses on improving robustness and performance: handling empty video paths and directory checks in dataset resolution, wrapping metadata parsing in try-except blocks, allowing configurable batch sizes, caching VAE latent statistics to avoid redundant tensor creation, and preventing a potential division-by-zero error in the dynamic time-shift scheduler.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-12T08:34:20Z

+    def _maybe_add_target_hw(self, sample, height, width):
+        if height in (None, "") or width in (None, ""):
+            return
+        sample["target_height"] = int(height)
+        sample["target_width"] = int(width)


Wrap the integer parsing of height and width in a try-except block to prevent the dataset loading from crashing due to malformed metadata.

Suggested change

def _maybe_add_target_hw(self, sample, height, width):

if height in (None, "") or width in (None, ""):

return

sample["target_height"] = int(height)

sample["target_width"] = int(width)

def _maybe_add_target_hw(self, sample, height, width):

if height in (None, "") or width in (None, ""):

return

try:

sample["target_height"] = int(height)

sample["target_width"] = int(width)

except ValueError:

logger.warning("Invalid height/width values: height={}, width={}", height, width)

gemini-code-assist · 2026-06-12T08:34:20Z

+    return DataLoader(
+        dataset,
+        batch_size=1,
+        shuffle=shuffle if sampler is None else False,
+        sampler=sampler,
+        num_workers=data_config.get("num_workers", 8),
+        pin_memory=data_config.get("pin_memory", True),
+    )


Allow the batch_size to be configured from data_config instead of hardcoding it to 1.

Suggested change

return DataLoader(

dataset,

batch_size=1,

shuffle=shuffle if sampler is None else False,

sampler=sampler,

num_workers=data_config.get("num_workers", 8),

pin_memory=data_config.get("pin_memory", True),

)

return DataLoader(

dataset,

batch_size=data_config.get("batch_size", 1),

shuffle=shuffle if sampler is None else False,

sampler=sampler,

num_workers=data_config.get("num_workers", 8),

pin_memory=data_config.get("pin_memory", True),

)

gemini-code-assist · 2026-06-12T08:34:20Z

+        if self.load_vae:
+            self.vae = AutoencoderKLWan.from_pretrained(
+                model_path,
+                subfolder="vae",
+                torch_dtype=self.vae_dtype,
+            ).to(self.device)
+            self.vae.requires_grad_(False)
+            if model_config.get("enable_vae_tiling", False):
+                self.vae.enable_tiling()


Pre-compute and cache latent_mean and latent_std on the target device during initialization to avoid recreating these tensors on every training step.

Suggested change

if self.load_vae:

self.vae = AutoencoderKLWan.from_pretrained(

model_path,

subfolder="vae",

torch_dtype=self.vae_dtype,

).to(self.device)

self.vae.requires_grad_(False)

if model_config.get("enable_vae_tiling", False):

self.vae.enable_tiling()

if self.load_vae:

self.vae = AutoencoderKLWan.from_pretrained(

model_path,

subfolder="vae",

torch_dtype=self.vae_dtype,

).to(self.device)

self.vae.requires_grad_(False)

if model_config.get("enable_vae_tiling", False):

self.vae.enable_tiling()

self.latent_mean = torch.tensor(self.vae.config.latents_mean, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)

self.latent_std = 1.0 / torch.tensor(self.vae.config.latents_std, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)

gemini-code-assist · 2026-06-12T08:34:20Z

+        latent_mean = torch.tensor(self.vae.config.latents_mean, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)
+        latent_std = 1.0 / torch.tensor(self.vae.config.latents_std, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)
+        latent = (latent - latent_mean) * latent_std


Use the cached self.latent_mean and self.latent_std attributes instead of recreating them on every call.

Suggested change

latent_mean = torch.tensor(self.vae.config.latents_mean, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)

latent_std = 1.0 / torch.tensor(self.vae.config.latents_std, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)

latent = (latent - latent_mean) * latent_std

latent = (latent - self.latent_mean) * self.latent_std

gemini-code-assist · 2026-06-12T08:34:20Z

+        latents_mean = torch.tensor(self.vae.config.latents_mean, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)
+        latents_std = 1.0 / torch.tensor(self.vae.config.latents_std, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)
+        latent = latent.to(dtype=self.vae_dtype) / latents_std + latents_mean


Use the cached self.latent_mean and self.latent_std attributes instead of recreating them on every call.

Suggested change

latents_mean = torch.tensor(self.vae.config.latents_mean, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)

latents_std = 1.0 / torch.tensor(self.vae.config.latents_std, device=self.device, dtype=self.vae_dtype).view(1, self.vae.config.z_dim, 1, 1, 1)

latent = latent.to(dtype=self.vae_dtype) / latents_std + latents_mean

latent = latent.to(dtype=self.vae_dtype) / self.latent_std + self.latent_mean

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

support want2v lora train

59bc946

gemini-code-assist Bot reviewed Jun 12, 2026

View reviewed changes

gushiqiao and others added 3 commits June 12, 2026 16:41

Update lightx2v_train/lightx2v_train/schedulers/time_shift.py

19ee2f0

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update lightx2v_train/lightx2v_train/data/video_dataset.py

3a215b7

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update lightx2v_train/lightx2v_train/data/video_dataset.py

7aa6fef

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support want2v lora train#1148

support want2v lora train#1148
gushiqiao wants to merge 4 commits into
mainfrom
gsq/dev-train

gushiqiao commented Jun 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gushiqiao commented Jun 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant