Skip to content

Issues: ggerganov/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 1
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 7
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Bug: CANN: Inference result garbled bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10252 opened Nov 11, 2024 by feichenchina
Bug: -nb cannot handle large number
#10251 opened Nov 11, 2024 by FNsi
Bug: In interactive chat mode (LLaMa 3.1 70B) sometimes llama.cpp fills in the user's side of the conversation. bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10249 opened Nov 10, 2024 by vfhbg
web UI : support syntax highlighting enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed
#10246 opened Nov 10, 2024 by slaren
Feature Request: Flash Attention 3 enhancement New feature or request
#10245 opened Nov 10, 2024 by hg0428
4 tasks done
Bug: missing tensor blk.0.ffn_down_exps.weight when loading mixtral-8x7b-instruct-v0.1.Q5_K_M.gguf bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10244 opened Nov 10, 2024 by hnfong
fatal error: 'hip/hip_fp16.h' file not found when building using CMake and ROCm 6.2 bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10236 opened Nov 9, 2024 by lubosz
Bug: server GET /props request return json with chat_template with last char replaced by \x00 bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10235 opened Nov 9, 2024 by kks-imt
Bug: CUBLAS_STATUS_INTERNAL_ERROR when using --gpu-layers on ROCm 6.2 bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10234 opened Nov 9, 2024 by lubosz
Bug: Server Slows Down Significantly Over Time, Requires Frequent Reboots (RX 7900 XT) bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10227 opened Nov 9, 2024 by tigert2173
Bug: image encoding error with malloc memory bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10225 opened Nov 9, 2024 by dingtine
bge-multilingual-gemma2:ERROR:hf-to-gguf:Model Gemma2Model is not supported bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10215 opened Nov 8, 2024 by hellozjj
Bug: not support langchain v0.3 to use tools bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10214 opened Nov 8, 2024 by lee249876293
Feature Request: Support Airllm enhancement New feature or request
#10202 opened Nov 7, 2024 by kbocock-krg
4 tasks done
Bug: DLLAMA_VULKAN=1 tag is not linking vulkan bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10201 opened Nov 7, 2024 by andrewson97
Bug: Nondeterministic results on AMD RDNA3 (ROCm) despite zero temperature and fixed seed bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10197 opened Nov 6, 2024 by Googulator
Bug: SYCL crash bug-unconfirmed critical severity Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#10184 opened Nov 5, 2024 by 0xDEADFED5
Feature Request: Support BitNet.cpp quantization format enhancement New feature or request
#10179 opened Nov 5, 2024 by luionTW
Bug: Failed to convert OuteAI/OuteTTS-0.1-350M bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10178 opened Nov 5, 2024 by apepkuss
Bug: Speculative Decoding "Segmentation fault (core dumped)" bug Something isn't working low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10176 opened Nov 4, 2024 by AbdullahMPrograms
tts : add basic example for text-to-speech good first issue Good for newcomers tts Text-to-speech
#10173 opened Nov 4, 2024 by ggerganov
Bug: CANN E89999 Ascend NPU issues specific to Ascend NPUs bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10161 opened Nov 4, 2024 by ninth99
Feature Request: [CANN] backend supports Ascend 310P Ascend NPU issues specific to Ascend NPUs enhancement New feature or request
#10160 opened Nov 4, 2024 by leo-pony
4 tasks done
ProTip! Follow long discussions with comments:>50.