-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support GEMM + Swiglu fused MLP
complexity: high
#3971
opened Mar 20, 2026 by
ksivaman
Loading…
5 tasks
Enable cpu_offloading with Full iteration CUDA graph
Final Review
PR is in the "final review" stage
#3969
opened Mar 20, 2026 by
vasunvidia
Loading…
5 tasks
Fix
mtp_use_repeated_layer behavior for GPT models
community-request
complexity: low
#3965
opened Mar 20, 2026 by
rkarimimahab
Loading…
5 tasks
Fix: qk-clip causes a tensor shape error when tensor-model-parallel-size > 1
community-request
Final Review
PR is in the "final review" stage
#3963
opened Mar 20, 2026 by
xzy-xzy
Loading…
5 tasks
Small quality-of-life improvements in All necessary approvals have been made
complexity: low
megatron/training
Approved
Add moe loss normalization for RL SFT
complexity: low
#3956
opened Mar 19, 2026 by
pthombre
Loading…
5 tasks
fix: use dump file prefix for NCCL flight recorder temp files
complexity: low
#3955
opened Mar 19, 2026 by
sbak5
Loading…
5 tasks
Fix completions endpoint
Approved
All necessary approvals have been made
complexity: low
Run functional tests
Fix key error in layer_wise_optimizer.sharded_state_dict
community-request
Final Review
PR is in the "final review" stage
#3939
opened Mar 19, 2026 by
chenchun
Loading…
1 of 5 tasks
Fix FSDP checkpoint conversion and loading for Qwen3.5-VL
community-request
Final Review
PR is in the "final review" stage
#3936
opened Mar 19, 2026 by
DAISY-gh
Loading…
5 tasks
Make text generation server hostname configurable
complexity: low
Final Review
PR is in the "final review" stage
#3935
opened Mar 18, 2026 by
santhnm2
Loading…
5 tasks
Improve load balancing behavior for prefix cache-aware routing
complexity: low
Final Review
PR is in the "final review" stage
Add --muon-coefficient-type argument for Muon optimizer
complexity: low
Final Review
PR is in the "final review" stage
#3927
opened Mar 18, 2026 by
mchrzanowski
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-02-20.