Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

Granite Four Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning python python script changes server testing Everything test related
#13550 opened May 14, 2025 by gabe-l-hart Draft
2 tasks
llama : try loading tensors with pre-computed hashes Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13106 opened Apr 25, 2025 by rgerganov Loading…
Metal TQ2_0 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12485 opened Mar 20, 2025 by dmahurin Loading…
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
Attempt to add the mllama support Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#11639 opened Feb 4, 2025 by q82419 Draft
3 of 5 tasks
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU Apple Metal https://en.wikipedia.org/wiki/Metal_(API) enhancement New feature or request ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs performance Speed related topics python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs testing Everything test related
#11183 opened Jan 10, 2025 by compilade Loading…
Add VisionOS compatibility by adding missing type definitions Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#11019 opened Dec 30, 2024 by sinkingsugar Loading…
Bamba architecture Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#10810 opened Dec 12, 2024 by gabe-l-hart Draft
3 tasks
naming : normalize the name of callback-related identifiers Apple Metal https://en.wikipedia.org/wiki/Metal_(API) breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#9405 opened Sep 10, 2024 by ggerganov Loading…
llama : initial Mamba-2 support Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level testing Everything test related
#9126 opened Aug 21, 2024 by compilade Loading…
8 of 9 tasks
Revert "ggml : remove OpenCL (#7735) + (#8235)" Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#8986 opened Aug 11, 2024 by okias Draft
2 of 4 tasks
Added support to select GPU using metal on Apple Intel or Apple Silicon using --main-gpu index Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8962 opened Aug 10, 2024 by ifeanyipossibilities Loading…
2 of 4 tasks
Rebalancing Metal threads workload in dot product kernel kernel_mul_mv_f16_f32_l4 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#7522 opened May 24, 2024 by izard Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.