Skip to content

[executorch][gemma4] fuse MLP gate/up at GGUF load #20481

Draft
Gasoonjia wants to merge 7 commits into
gemma4_31b-cuda-decode-speedupfrom
gemma4_31b-mlp-fusion-unified
Draft

[executorch][gemma4] fuse MLP gate/up at GGUF load #20481
Gasoonjia wants to merge 7 commits into
gemma4_31b-cuda-decode-speedupfrom
gemma4_31b-mlp-fusion-unified

[executorch][gemma4] fuse MLP gate/up at GGUF load (single point, cud…

638f07a
Select commit
Loading
Failed to load commit list.

Select a check to view from the sidebar