-
Notifications
You must be signed in to change notification settings - Fork 14.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix page-alignment issue in ggml_metal_get_tensor_async
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#18738
opened Jan 10, 2026 by
amakropoulos
Loading…
test-backend-ops: fix mxfp4 tests on blackwell
testing
Everything test related
#18736
opened Jan 10, 2026 by
am17an
Loading…
feat: add support for WeDLM architecture
python
python script changes
#18731
opened Jan 10, 2026 by
feedseawave
Loading…
5 tasks done
lookup, lookahead: fix crash when n_ctx not specified
examples
#18729
opened Jan 10, 2026 by
pestopoppa
Loading…
preset: allow named remote preset
documentation
Improvements or additions to documentation
#18728
opened Jan 9, 2026 by
ngxson
Loading…
kv-cache: optimize SWA slot reuse with forward-looking masking
#18727
opened Jan 9, 2026 by
pestopoppa
Loading…
opencl: add softplus op
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#18726
opened Jan 9, 2026 by
shaofeiqi
Loading…
llama: fix pooled embedding readback sizing/stride and state I/O
#18723
opened Jan 9, 2026 by
retr0reg
Loading…
model: Add VAETKI support
examples
model
Model specific
python
python script changes
#18719
opened Jan 9, 2026 by
dororodoroddo
Loading…
5 tasks done
server : adjust unified KV cache tests
examples
python
python script changes
server
#18716
opened Jan 9, 2026 by
ggerganov
Loading…
Support parsing JSON into grammar for schemas with no type and no properties
#18711
opened Jan 9, 2026 by
markrietveld
•
Draft
vulkan: Check maxStorageBufferRange in supports_op
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18709
opened Jan 9, 2026 by
jeffbolznv
Loading…
ggml-metal: Clean up files used for embedded build
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#18705
opened Jan 9, 2026 by
DaAwesomeP
Loading…
[WIP] ggml-opencl: op args init refactoring
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
Improving inference speed for the repack buffer type on NUMA architectures
ggml
changes relating to the ggml tensor library for machine learning
#18698
opened Jan 8, 2026 by
zzjianhui
Loading…
debug : include LLAMA_POOLING_TYPE_UNSPECIFIED in pooling check
examples
#18692
opened Jan 8, 2026 by
danbev
Loading…
ggml-cuda: extend concat support for more types
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18690
opened Jan 8, 2026 by
Lourdle
Loading…
vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18678
opened Jan 7, 2026 by
jeffbolznv
Loading…
HIP: adjust RDNA3.5 MMQ kernel selction logic
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18666
opened Jan 7, 2026 by
JohannesGaessler
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.