-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18202
opened Dec 19, 2025 by
IMbackK
Loading…
ci : remove non-windows zip artifacts
devops
improvements to build systems and github actions
#18201
opened Dec 19, 2025 by
CISC
Loading…
llamafile: add rvv support for sgemm kernels
ggml
changes relating to the ggml tensor library for machine learning
#18199
opened Dec 19, 2025 by
taimur-10x
Loading…
arg: fix order to use short form before long form
examples
server
testing
Everything test related
#18196
opened Dec 19, 2025 by
ServeurpersoCom
Loading…
cmake: Added more x86_64 CPU backends when building with changes relating to the ggml tensor library for machine learning
GGML_CPU_ALL_VARIANTS=On
ggml
vulkan: fix im2col overflowing maxworkgroupcount
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18180
opened Dec 18, 2025 by
jeffbolznv
Loading…
vulkan: Warptile tuning for Intel Xe2/Xe3
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18178
opened Dec 18, 2025 by
virajwad
Loading…
tool/ex/tests: consistently free ctx, then model
examples
testing
Everything test related
#18168
opened Dec 18, 2025 by
JohannesGaessler
Loading…
Adding --direct-io flag for model loading
examples
#18166
opened Dec 18, 2025 by
JTischbein
Loading…
spm: make llama a dynamic library; leave placeholder for ggml/gguf na…
#18165
opened Dec 18, 2025 by
steven-moon
Loading…
ggml-hexagon: gelu optimization
ggml
changes relating to the ggml tensor library for machine learning
#18151
opened Dec 17, 2025 by
joeldushouyu
•
Draft
ggml-cpu: fix todo comment #15953 and SIMD-like calculate 4 elems
ggml
changes relating to the ggml tensor library for machine learning
#18150
opened Dec 17, 2025 by
GermanAizek
Loading…
[WIP] Enable cooperative matrix support for Intel Arrow Lake H GPUs
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
server: validate n_batch == n_ubatch for embeddings (#6263)
examples
server
#18123
opened Dec 17, 2025 by
yifant-code
•
Draft
[WIP] Cross Entropy Loss on Metal
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
ggml-hexagon: Add lightweight atomic synchronization support to htp_ops_context for inter-task coordination
ggml
changes relating to the ggml tensor library for machine learning
#18113
opened Dec 16, 2025 by
ngdxzy
Loading…
ggml-cuda: Delta-Net linear attention for Qwen3-Next
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#18102
opened Dec 16, 2025 by
hauhaut
Loading…
vulkan/cuda: fix topk_moe with exp_probs_b
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18071
opened Dec 15, 2025 by
jeffbolznv
Loading…
webui: add responsive chat width option to webui (#18067)
examples
server
#18068
opened Dec 15, 2025 by
ImadSaddik
Loading…
vulkan: support GGML_UNARY_OP_XIELU
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18062
opened Dec 15, 2025 by
jeffbolznv
Loading…
vulkan: in graph_optimize, try to group ADD operations
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18060
opened Dec 15, 2025 by
jeffbolznv
Loading…
webui: Client-side implementation of tool calling (with two tools)
examples
server
#18059
opened Dec 15, 2025 by
coder543
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.