Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

sycl: Add reorder to Q6_K mmvq implementation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13885 opened May 29, 2025 by s-Nick Loading…
sycl: quantize and reorder the input to q8_1 when reorder is enabled ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13826 opened May 27, 2025 by AD2605 Loading…
SYCL: Implement few same quantized type copy kernels ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13739 opened May 24, 2025 by qnixsynapse Draft
remove templates from soft_max_f32_submitter to allow SYCL graph updates ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13724 opened May 23, 2025 by lslusarczyk Loading…
llama: Fix typos in multiple files ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13369 opened May 8, 2025 by co63oc Loading…
llama : try loading tensors with pre-computed hashes Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13106 opened Apr 25, 2025 by rgerganov Loading…
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
ggml: move kvalues_iq4nl definition to ggml-common.h ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11785 opened Feb 10, 2025 by HungMingWu Loading…
Clean up Test Script + Update it to work on Instruct Tuned Models examples SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11610 opened Feb 3, 2025 by Mr-Thack Loading…
[SYCL] pass SYCL CI devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#10041 opened Oct 25, 2024 by airMeng Loading…
2 of 4 tasks
add print cpu info ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#9957 opened Oct 20, 2024 by NeoZhangJianyu Loading…
2 of 4 tasks
[Draft] Tensor Parallel support to llama.cpp ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#9648 opened Sep 26, 2024 by ClarkChin08 Loading…
1 of 3 tasks
Revert "ggml : remove OpenCL (#7735) + (#8235)" Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#8986 opened Aug 11, 2024 by okias Draft
2 of 4 tasks
ProTip! no:milestone will show everything without a milestone.