-
Notifications
You must be signed in to change notification settings - Fork 12.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
CUDA: add roll
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14919
opened Jul 28, 2025 by
am17an
Loading…
repack : optimize mul_mat_id path
ggml
changes relating to the ggml tensor library for machine learning
#14918
opened Jul 28, 2025 by
ggerganov
Loading…
1 task
opencl: add ops docs
documentation
Improvements or additions to documentation
#14910
opened Jul 28, 2025 by
lhez
Loading…
opencl: fixed a typo
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14908
opened Jul 27, 2025 by
l29ah
Loading…
cuda : add softcap fusion
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14907
opened Jul 27, 2025 by
CISC
Loading…
ggml : repack block_iq4_nlx8 (AVX)
ggml
changes relating to the ggml tensor library for machine learning
#14904
opened Jul 27, 2025 by
ggerganov
Loading…
1 task
Vulkan: Fix minor debug mode issues
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14899
opened Jul 27, 2025 by
0cc4m
Loading…
GGML: Fix leak of backend buffer memory address in RPC
ggml
changes relating to the ggml tensor library for machine learning
#14882
opened Jul 26, 2025 by
struct
Loading…
model: add hunyuan dense
python
python script changes
#14878
opened Jul 25, 2025 by
stevenkuang-tencent
Loading…
Adding chat template support for Granite model
testing
Everything test related
#14864
opened Jul 24, 2025 by
smdesai
Loading…
test-backend-ops: enables perf/eval testing of composite ops
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#14833
opened Jul 23, 2025 by
etasnadi
Loading…
graph : reduce splits for recurrent and hybrid models
performance
Speed related topics
#14825
opened Jul 23, 2025 by
compilade
Loading…
feat(batched): Add functionality to upload benchmark test results
examples
#14811
opened Jul 22, 2025 by
MengAiDev
Loading…
convert : handle pre-quantized models
enhancement
New feature or request
python
python script changes
#14810
opened Jul 22, 2025 by
compilade
Loading…
2 tasks
opencl: tiled mul_mat with local memory for f16 and f32
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
Add LLaDA 8b Diffusion model
examples
python
python script changes
#14771
opened Jul 19, 2025 by
am17an
Loading…
docs : mention apt installation method
documentation
Improvements or additions to documentation
#14766
opened Jul 19, 2025 by
vp2177
Loading…
feat: Add extended sampling API with candidate token lists #14612
#14765
opened Jul 19, 2025 by
baonudesifeizhai
Loading…
webui: add missing messages in export (#13552)
examples
server
#14764
opened Jul 18, 2025 by
srogmann
Loading…
Fix MinicpmV model converter and clip to avoid using hardcode.
examples
python
python script changes
#14750
opened Jul 18, 2025 by
gryffindor-rr
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.