-
Notifications
You must be signed in to change notification settings - Fork 297
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(docker): optimize Dockerfile.megatron to reduce image size by 1.21 GB
run_train_megatron_gpu_ci
#1499
opened Apr 11, 2026 by
dinhxuanvu
Loading…
[train][multimodal][3/3] Trainer changes to extract multi-modal outputs from GeneratorOutput
#1498
opened Apr 11, 2026 by
nithinvc
Contributor
Loading…
[skyrl][tinker] Use VLLMRenderer in SkyRL train backend
#1496
opened Apr 10, 2026 by
nithinvc
Contributor
Loading…
[train][multimodal][1/3] Add vision support to generate() in new inference stack
#1494
opened Apr 10, 2026 by
nithinvc
Contributor
Loading…
3 tasks done
[tinker] Fix single request batching in TinkerEngine
#1489
opened Apr 10, 2026 by
pcmoritz
Collaborator
Loading…
[multimodal] add language_model_only flag for models like qwen3.5
#1487
opened Apr 9, 2026 by
erictang000
Collaborator
Loading…
[train][multimodal][2/3] Add multi-turn VLM generator
#1486
opened Apr 9, 2026 by
nithinvc
Contributor
Loading…
2 tasks done
[skyrl][tinker] Multi-modal Tinker Sampling
#1484
opened Apr 9, 2026 by
nithinvc
Contributor
Loading…
3 tasks done
[fix][train] Prompt-based mini-batching for step-wise training
#1483
opened Apr 9, 2026 by
CharlieFRuan
Member
Loading…
3 tasks done
Add prefix-aware merging for step-wise training
#1479
opened Apr 8, 2026 by
CharlieFRuan
Member
Loading…
3 tasks done
feat: add max_tokens_per_microbatch config for token-based micro-batching
#1477
opened Apr 8, 2026 by
erictang000
Collaborator
Loading…
feat: native Atropos-SHM integration and modular ingestion layer
#1473
opened Apr 7, 2026 by
RUFFY-369
Loading…
[train] Enable expandable_segments to reduce GPU memory fragmentation
run_train_gpu_ci
#1470
opened Apr 7, 2026 by
CharlieFRuan
Member
•
Draft
5 tasks done
[tinker] Support prompt_logprobs in SkyRLTrainBackend sample() path
#1461
opened Apr 6, 2026 by
pbokc
Contributor
Loading…
[tinker] Support KL loss in SkyRLTrainBackend
#1460
opened Apr 5, 2026 by
pbokc
Contributor
Loading…
feat: LLM-synthesized hints for failed trajectories
#1456
opened Apr 4, 2026 by
dzorlu
Loading…
4 tasks
[skyrl-train] feat: add native GMPO policy loss with validation and tests
#1449
opened Apr 2, 2026 by
taivu1998
Loading…
Fix event-loop blocking in one-step-off async save/export paths
#1446
opened Apr 2, 2026 by
taivu1998
Loading…
Change default KL estimator from k3 to k2 for loss-based KL
#1445
opened Apr 2, 2026 by
taivu1998
Loading…
[skyrl-train] Add trainer-side max_response_length for Dr. GRPO normalization and DAPO overlong handling
#1440
opened Apr 2, 2026 by
taivu1998
Loading…
[WIP][tx] Add initial implementation of RayJaxBackend
#1418
opened Mar 31, 2026 by
andrewsykim
Contributor
•
Draft
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.