Skip to content

[executorch][gemma4] fuse MLP gate/up at GGUF load #20481

Draft
Gasoonjia wants to merge 7 commits into
gemma4_31b-cuda-decode-speedupfrom
gemma4_31b-mlp-fusion-unified
Draft

[executorch][gemma4] fuse MLP gate/up at GGUF load #20481
Gasoonjia wants to merge 7 commits into
gemma4_31b-cuda-decode-speedupfrom
gemma4_31b-mlp-fusion-unified

Commits

Commits on Jun 23, 2026

Commits on Jun 24, 2026