Skip to content

AMD - gpt-oss vllm mxfp4: AITER tuning + n-gram spec decode + server …#1657

Draft
nehaprakriya wants to merge 1 commit into
SemiAnalysisAI:mainfrom
nehaprakriya:gptoss-fp4-mi355x-aiter-specdec
Draft

AMD - gpt-oss vllm mxfp4: AITER tuning + n-gram spec decode + server …#1657
nehaprakriya wants to merge 1 commit into
SemiAnalysisAI:mainfrom
nehaprakriya:gptoss-fp4-mi355x-aiter-specdec