Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Use Megatron-Bridge recipes for megatron_cfg.
#2096 opened Mar 11, 2026 by sfawzy-nv Loading…
4 tasks
feat: nemo gym vlm support CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2092 opened Mar 9, 2026 by cmunley1 Loading…
4 tasks
Fix grammar and typos in README
#2091 opened Mar 9, 2026 by terrykong Loading…
1 task
Add Eagle3 online speculative decoding support
#2078 opened Mar 6, 2026 by isomap Loading…
4 tasks
fix: add Qwen3.5 related changes
#2076 opened Mar 6, 2026 by zpqiu Loading…
2 of 9 tasks
tests: add megatron bump suite CI:docs Run doctest
#2068 opened Mar 5, 2026 by terrykong Loading…
4 tasks
ci: Temp disable megatron lora grpo tests CI:docs Run doctest
#2062 opened Mar 4, 2026 by chtruong814 Loading…
4 tasks
Vllm grpo experiments documentation Improvements or additions to documentation
#2059 opened Mar 4, 2026 by shaunjoshi Loading…
4 tasks
feat: Add YaRN rope scaling support on Magatron-Bridge CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) documentation Improvements or additions to documentation
#2052 opened Mar 3, 2026 by RayenTian Draft
4 tasks
docs: add prerequisites, troubleshooting, and build verification for GRPO quickstart community-request documentation Improvements or additions to documentation
#2051 opened Mar 3, 2026 by brluobt Loading…
3 tasks
chore: bumpup Megatron-Bridge submodule to main CI:L2 Run doctests, unit tests, functional tests, and convergence tests Run CICD
#2039 opened Mar 1, 2026 by ZhiyuLi-Nvidia Loading…
4 tasks
fp8 refit opt Performance Related to improving performance
#2037 opened Feb 28, 2026 by Jianbing-D Draft
4 tasks
fix: address deprecation warning for using a non-tuple sequence for multidimensional indexing CI:L1 Run doctests, unit tests, and functional tests
#2032 opened Feb 27, 2026 by ananthsub Loading…
1 of 4 tasks
feat: basic ppo training implementation
#2027 opened Feb 26, 2026 by hXl3s Draft
4 tasks
feat: Dynamo router support
#2023 opened Feb 25, 2026 by jthomson04 Draft
4 tasks
ProTip! Exclude everything labeled bug with -label:bug.