-
Notifications
You must be signed in to change notification settings - Fork 9.7k
Issues: ggerganov/llama.cpp
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Bug: CANN: Inference result garbled
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10252
opened Nov 11, 2024 by
feichenchina
Bug: In interactive chat mode (LLaMa 3.1 70B) sometimes llama.cpp fills in the user's side of the conversation.
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10249
opened Nov 10, 2024 by
vfhbg
web UI : support syntax highlighting
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
#10246
opened Nov 10, 2024 by
slaren
Feature Request: Flash Attention 3
enhancement
New feature or request
#10245
opened Nov 10, 2024 by
hg0428
4 tasks done
Bug: missing tensor blk.0.ffn_down_exps.weight when loading mixtral-8x7b-instruct-v0.1.Q5_K_M.gguf
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10244
opened Nov 10, 2024 by
hnfong
fatal error: 'hip/hip_fp16.h' file not found when building using CMake and ROCm 6.2
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10236
opened Nov 9, 2024 by
lubosz
Bug: server GET /props request return json with chat_template with last char replaced by \x00
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10235
opened Nov 9, 2024 by
kks-imt
Bug: CUBLAS_STATUS_INTERNAL_ERROR when using --gpu-layers on ROCm 6.2
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10234
opened Nov 9, 2024 by
lubosz
Bug: Server Slows Down Significantly Over Time, Requires Frequent Reboots (RX 7900 XT)
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10227
opened Nov 9, 2024 by
tigert2173
Bug: image encoding error with malloc memory
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10225
opened Nov 9, 2024 by
dingtine
bge-multilingual-gemma2:ERROR:hf-to-gguf:Model Gemma2Model is not supported
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10215
opened Nov 8, 2024 by
hellozjj
Bug: not support langchain v0.3 to use tools
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10214
opened Nov 8, 2024 by
lee249876293
Feature Request: Support Airllm
enhancement
New feature or request
#10202
opened Nov 7, 2024 by
kbocock-krg
4 tasks done
Bug: DLLAMA_VULKAN=1 tag is not linking vulkan
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10201
opened Nov 7, 2024 by
andrewson97
Bug: Nondeterministic results on AMD RDNA3 (ROCm) despite zero temperature and fixed seed
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10197
opened Nov 6, 2024 by
Googulator
Bug: SYCL crash
bug-unconfirmed
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#10184
opened Nov 5, 2024 by
0xDEADFED5
ggml : move LLAMAFILE/tinyBLAS into a backend
refactoring
Refactoring
#10183
opened Nov 5, 2024 by
ggerganov
ggml : refactor ggml-cpu.c into multiple C++ source files
refactoring
Refactoring
#10180
opened Nov 5, 2024 by
ggerganov
Feature Request: Support BitNet.cpp quantization format
enhancement
New feature or request
#10179
opened Nov 5, 2024 by
luionTW
Bug: Failed to convert Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
OuteAI/OuteTTS-0.1-350M
bug-unconfirmed
medium severity
#10178
opened Nov 5, 2024 by
apepkuss
Bug: Speculative Decoding "Segmentation fault (core dumped)"
bug
Something isn't working
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10176
opened Nov 4, 2024 by
AbdullahMPrograms
tts : add basic example for text-to-speech
good first issue
Good for newcomers
tts
Text-to-speech
#10173
opened Nov 4, 2024 by
ggerganov
Bug: CANN E89999
Ascend NPU
issues specific to Ascend NPUs
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10161
opened Nov 4, 2024 by
ninth99
Feature Request: [CANN] backend supports Ascend 310P
Ascend NPU
issues specific to Ascend NPUs
enhancement
New feature or request
#10160
opened Nov 4, 2024 by
leo-pony
4 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.