Skip to content

Add NVFP4 per-token quantization recipe#3045

Draft
cael-ling wants to merge 14 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-per-token-recipe
Draft

Add NVFP4 per-token quantization recipe#3045
cael-ling wants to merge 14 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-per-token-recipe

Commits

Commits on May 27, 2026

Commits on May 28, 2026

Commits on May 29, 2026

Commits on May 30, 2026

Commits on May 31, 2026

Commits on Jun 2, 2026