Skip to content

[gemma4_31b][cuda] length-aware bf16 global attention + head_dim-agno…#20506

Draft
Gasoonjia wants to merge 1 commit into
gemma4_31b-cuda-decode-speedupfrom
gemma4_31b-cuda-attn-perf-git
Draft

[gemma4_31b][cuda] length-aware bf16 global attention + head_dim-agno…#20506
Gasoonjia wants to merge 1 commit into
gemma4_31b-cuda-decode-speedupfrom
gemma4_31b-cuda-attn-perf-git

Commits

Commits on Jun 25, 2026