-
Notifications
You must be signed in to change notification settings - Fork 421
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
eval skill: parameterize external judge/user-sim endpoints via .env
#1591
opened Jun 1, 2026 by
cjluo-nv
Collaborator
Loading…
Add Megatron-Bridge PTQ quantize + export example scripts
#1589
opened Jun 1, 2026 by
kevalmorabia97
Collaborator
Loading…
Refactor local_hessian onto shared MSE flow + fused-MoE expert support
#1578
opened Jun 1, 2026 by
Fridah-nv
Contributor
Loading…
feat: Layerwise calibration: nested config + QDQ-from-prev-layer flag + checkpoint I/O knobs
#1571
opened May 29, 2026 by
Fridah-nv
Contributor
Loading…
[6078291][OMNIML-3716] Add ViT FP8/NVFP4 recipes + Torch-TRT example, wire softmax_quantizer in _QuantAttention
#1569
opened May 29, 2026 by
ajrasane
Contributor
Loading…
[OMNIML-4788] specdec_bench/Qwen3.5-4B: throughput_32k benchmark + S3 upload step
#1564
opened May 28, 2026 by
ChenhanYu
Collaborator
Loading…
Autoquant and GPTQ in support in Megatron-Core [OMNIML-3151]
#1562
opened May 28, 2026 by
jenchen13
Contributor
Loading…
[OMNIML-3994] Make sure all weight quantizers have _amax
#1560
opened May 28, 2026 by
sychen52
Contributor
Loading…
[5924759] Fix fp16 ONNX INT8 entropy calibration on numpy >= 2.0
#1558
opened May 28, 2026 by
ajrasane
Contributor
Loading…
WIP Support per expert amax in TEGroupedMLP
#1550
opened May 27, 2026 by
jenchen13
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-05-29.