NovaSky-AI / SkyRL Public

Notifications You must be signed in to change notification settings
Fork 297
Star 1.8k

Code
Issues 179
Pull requests 125
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: NovaSky-AI/SkyRL

Labels 19 Milestones 0

New pull request New

125 Open 1,055 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix(docker): optimize Dockerfile.megatron to reduce image size by 1.21 GB run_train_megatron_gpu_ci

#1499 opened Apr 11, 2026 by dinhxuanvu

Loading…

[train][multimodal][3/3] Trainer changes to extract multi-modal outputs from GeneratorOutput

#1498 opened Apr 11, 2026 by nithinvc Contributor

Loading…

[skyrl][tinker] Use VLLMRenderer in SkyRL train backend

#1496 opened Apr 10, 2026 by nithinvc Contributor

Loading…

[train][multimodal][1/3] Add vision support to generate() in new inference stack

#1494 opened Apr 10, 2026 by nithinvc Contributor

Loading…

3 tasks done

[tinker] Fix single request batching in TinkerEngine

#1489 opened Apr 10, 2026 by pcmoritz Collaborator

Loading…

[multimodal] add language_model_only flag for models like qwen3.5

#1487 opened Apr 9, 2026 by erictang000 Collaborator

Loading…

[train][multimodal][2/3] Add multi-turn VLM generator

#1486 opened Apr 9, 2026 by nithinvc Contributor

Loading…

2 tasks done

[skyrl][tinker] Multi-modal Tinker Sampling

#1484 opened Apr 9, 2026 by nithinvc Contributor

Loading…

3 tasks done

[fix][train] Prompt-based mini-batching for step-wise training

#1483 opened Apr 9, 2026 by CharlieFRuan Member

Loading…

3 tasks done

Add ppo as alias for dual_clip policy loss type

#1481 opened Apr 8, 2026 by j316chuck • Draft

[feat][train] Enable new inference codepath by default

#1480 opened Apr 8, 2026 by SumanthRH Member • Draft

Add prefix-aware merging for step-wise training

#1479 opened Apr 8, 2026 by CharlieFRuan Member

Loading…

3 tasks done

feat: add max_tokens_per_microbatch config for token-based micro-batching

#1477 opened Apr 8, 2026 by erictang000 Collaborator

Loading…

[CI] Migrate GPU CI to run on new inference codepath run_train_gpu_ci

#1476 opened Apr 8, 2026 by SumanthRH Member • Draft

feat: native Atropos-SHM integration and modular ingestion layer

#1473 opened Apr 7, 2026 by RUFFY-369

Loading…

[train] Enable expandable_segments to reduce GPU memory fragmentation run_train_gpu_ci

#1470 opened Apr 7, 2026 by CharlieFRuan Member • Draft

5 tasks done

[tinker] Support prompt_logprobs in SkyRLTrainBackend sample() path

#1461 opened Apr 6, 2026 by pbokc Contributor

Loading…

[tinker] Support KL loss in SkyRLTrainBackend

#1460 opened Apr 5, 2026 by pbokc Contributor

Loading…

feat: LLM-synthesized hints for failed trajectories

#1456 opened Apr 4, 2026 by dzorlu

Loading…

4 tasks

[skyrl-train] feat: add native GMPO policy loss with validation and tests

#1449 opened Apr 2, 2026 by taivu1998

Loading…

Fix event-loop blocking in one-step-off async save/export paths

#1446 opened Apr 2, 2026 by taivu1998

Loading…

Change default KL estimator from k3 to k2 for loss-based KL

#1445 opened Apr 2, 2026 by taivu1998

Loading…

[skyrl-train] Flip grpo_norm_by_std default to false

#1443 opened Apr 2, 2026 by taivu1998

Loading…

[skyrl-train] Add trainer-side max_response_length for Dr. GRPO normalization and DAPO overlong handling

#1440 opened Apr 2, 2026 by taivu1998

Loading…

[WIP][tx] Add initial implementation of RayJaxBackend

#1418 opened Mar 31, 2026 by andrewsykim Contributor • Draft

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!