#
ik-llama-cpp
Here are 4 public repositories matching this topic...
Reproducible Gemma 4 multi-token-prediction bench harness for ik_llama.cpp. PR #1744 merged 2026-05-10. 2.6-2.98x lossless speedup verified. Two scripts to clone, build, bench.
-
Updated
May 16, 2026 - Shell
Turboquant Q4/Q3 with IQK FA
-
Updated
Apr 19, 2026 - C++
Turboquant Q4/Q3 with IQK FA
-
Updated
Apr 19, 2026 - Python
Improve this page
Add a description, image, and links to the ik-llama-cpp topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ik-llama-cpp topic, visit your repo's landing page and select "manage topics."