Add test coverage for Muon muon_lr/adam_lr overrides by sowndappan5 · Pull Request #8047 · deepspeedai/DeepSpeed

sowndappan5 · 2026-06-04T03:05:35Z

Summary

Add coverage for separate learning rate overrides in the Muon optimizer path and fix the related Muon blog documentation.

Background

Muon parameters and non-Muon parameters are automatically split into separate optimizer groups. The intended behavior is:

muon_lr applies to Muon parameter groups
adam_lr applies to Adam parameter groups
lr remains the fallback for both groups when overrides are not provided

Changes

add a parameterized test covering:
- legacy lr fallback behavior
- separate muon_lr / adam_lr override behavior
fix the Muon blog table header to label muon_lr and adam_lr correctly

Validation

Ran:
python -m pytest DeepSpeed/tests/unit/ops/muon/test_muon_partial_training.py -k learning_rate_overrides -q -rs

Result:

test collected successfully
skipped locally because this distributed test requires 2 GPUs, while the local environment has 1 GPU

delock · 2026-06-05T04:04:51Z

 ### Evaluation Results

-| Optimizer | Learning Rate | adam_lr (for Muon) | MBPP  | MBPP+ | MMLU   | GSM8K  |
+| Optimizer | muon_lr | adam_lr | MBPP  | MBPP+ | MMLU   | GSM8K  |


It looks like change the title this way does not consistent with table content down below, i.e. AdamW learning rate is not muon learning rate. Does this file have to be modified?

sowndappan5 · 2026-06-05T04:59:17Z

I removed the README table change and updated the PR to keep it focused on the test coverage for the existing muon_lr / adam_lr behavior.

delock · 2026-06-05T05:13:42Z

Hi @sowndappan5 thanks for your new test cases. Can you address the comments in README.md and also fix the DCO tests by sign-off your commits? Thanks!

sowndappan5 · 2026-06-05T05:17:35Z

Thanks, I’ve addressed the README feedback and pushed the update. I’m fixing the DCO issue now by signing off the commits.

Signed-off-by: Sowndappan S <147894621+sowndappan5@users.noreply.github.com>

sowndappan5 · 2026-06-05T05:25:15Z

I’ve addressed the README feedback and force-pushed signed-off commits to fix the DCO issue. The PR is now updated and pending review/CI.

…date README contributors section Signed-off-by: Sowndappan S <147894621+sowndappan5@users.noreply.github.com>

sowndappan5 requested review from loadams, tjruwase and tohtana as code owners June 4, 2026 03:05

sowndappan5 mentioned this pull request Jun 4, 2026

[REQUEST] Muon Optimizer - Different LR for Different Groups #7657

Open

sfc-gh-truwase requested review from PKUWZP and delock and removed request for loadams, tjruwase and tohtana June 4, 2026 11:45

delock reviewed Jun 5, 2026

View reviewed changes

sowndappan5 added 2 commits June 5, 2026 05:21

Add pytest parameterized test for Muon and Adam learning rate overrides

e192979

Signed-off-by: Sowndappan S <147894621+sowndappan5@users.noreply.github.com>

Remove README table header change from Muon LR override PR

b7a3315

Signed-off-by: Sowndappan S <147894621+sowndappan5@users.noreply.github.com>

sowndappan5 force-pushed the master branch from 093973f to b7a3315 Compare June 5, 2026 05:21

delock approved these changes Jun 5, 2026

View reviewed changes

delock enabled auto-merge (squash) June 5, 2026 12:46

auto-merge was automatically disabled June 5, 2026 13:08
Head branch was pushed to by a user without write access

Refactor test cases for Muon optimizer learning rate overrides and up…

54d74a6

…date README contributors section Signed-off-by: Sowndappan S <147894621+sowndappan5@users.noreply.github.com>

sowndappan5 force-pushed the master branch from b0f9eb2 to 54d74a6 Compare June 5, 2026 13:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test coverage for Muon muon_lr/adam_lr overrides#8047

Add test coverage for Muon muon_lr/adam_lr overrides#8047
sowndappan5 wants to merge 3 commits into
deepspeedai:masterfrom
sowndappan5:master

sowndappan5 commented Jun 4, 2026

Uh oh!

delock Jun 5, 2026 •

edited

Loading

Uh oh!

sowndappan5 commented Jun 5, 2026 •

edited

Loading

Uh oh!

delock commented Jun 5, 2026

Uh oh!

sowndappan5 commented Jun 5, 2026

Uh oh!

sowndappan5 commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sowndappan5 commented Jun 4, 2026

Summary

Background

Changes

Validation

Uh oh!

delock Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sowndappan5 commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

delock commented Jun 5, 2026

Uh oh!

sowndappan5 commented Jun 5, 2026

Uh oh!

sowndappan5 commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

delock Jun 5, 2026 •

edited

Loading

sowndappan5 commented Jun 5, 2026 •

edited

Loading