-
Notifications
You must be signed in to change notification settings - Fork 391
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(security): bump ray, mlflow, urllib3 and nemo-gym for CVE remediation
#2560
opened May 22, 2026 by
kajalj22
Contributor
Loading…
3 tasks
feat(grpo): add sequence-level logprob error metrics
CI:L0
Run doctests and unit tests
super-v3
#2559
opened May 22, 2026 by
macandro96
Loading…
4 tasks
beep boop 🤖: Bumping NeMo-RL to v0.6.1
CI
Relating to CI
#2549
opened May 22, 2026 by
nemo-automation-bot
Bot
Loading…
beep boop 🤖: Bumping NeMo-RL to v0.6.1
#2548
opened May 22, 2026 by
nemo-automation-bot
Bot
Loading…
beep boop 🤖: Bumping NeMo-RL to v0.6.1
#2547
opened May 22, 2026 by
nemo-automation-bot
Bot
Loading…
ci: pin FW-CI-templates to NVIDIA-NeMo/FW-CI-templates#480
CI:docs
Run doctest
CI
Relating to CI
#2546
opened May 22, 2026 by
ko3n1g
Contributor
Loading…
feat: make only_unmask_final configurable in SFT
#2543
opened May 22, 2026 by
ashors1
Contributor
Loading…
4 tasks
[feat:] Add CISPO loss
community-request
Documentation
Improvements or additions to documentation
waiting-on-maintainers
Waiting on maintainers to respond
#2531
opened May 19, 2026 by
pengdurice
Contributor
Loading…
1 of 4 tasks
feat: PPO with MCore
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2530
opened May 19, 2026 by
bg51717
Contributor
Loading…
4 tasks done
feat: add AsyncNemoGymRolloutManager for gym per-prompt rollouts
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2528
opened May 19, 2026 by
yuki-97
Contributor
Loading…
1 task done
refactor(distillation): migrate DistillationConfig, DistillationSaveS…
CI:L1
Run doctests, unit tests, and functional tests
#2527
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(rm): migrate RMConfig, RMSaveState, RMValMetrics to BaseModel
CI:L1
Run doctests, unit tests, and functional tests
#2526
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(sft): migrate SFTConfig, SFTSaveState to BaseModel
CI:L1
Run doctests, unit tests, and functional tests
#2525
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(dpo): migrate DPOConfig, DPOSaveState, DPOValMetrics to Base…
CI:L1
Run doctests, unit tests, and functional tests
#2524
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(loss): migrate DPOLossConfig, DistillationLossConfig, DraftC…
CI:L1
Run doctests, unit tests, and functional tests
#2520
opened May 18, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(grpo): migrate TypedDict configs to pydantic BaseModel
CI:L1
Run doctests, unit tests, and functional tests
#2518
opened May 18, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
feat: fix the vLLM DP path
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#2517
opened May 18, 2026 by
guyueh1
Contributor
Loading…
4 tasks
feat(sft): make only_unmask_final configurable in SFTConfig
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2516
opened May 17, 2026 by
yuki-97
Contributor
Loading…
1 task done
fix: fix preserving dataset merge
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2515
opened May 17, 2026 by
yuki-97
Contributor
Loading…
1 task done
Previous Next
ProTip!
Updated in the last three days: updated:>2026-05-19.