-
Notifications
You must be signed in to change notification settings - Fork 12k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
server
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
llama : try loading tensors with pre-computed hashes
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#13106
opened Apr 25, 2025 by
rgerganov
Loading…
Metal TQ2_0
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12485
opened Mar 20, 2025 by
dmahurin
Loading…
tool-call
: Phi-4 support
android
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
Attempt to add the https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
mllama
support
Apple Metal
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
enhancement
New feature or request
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
performance
Speed related topics
python
python script changes
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
testing
Everything test related
#11183
opened Jan 10, 2025 by
compilade
Loading…
Add VisionOS compatibility by adding missing type definitions
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#11019
opened Dec 30, 2024 by
sinkingsugar
Loading…
Bamba architecture
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#10810
opened Dec 12, 2024 by
gabe-l-hart
•
Draft
3 tasks
naming : normalize the name of callback-related identifiers
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#9405
opened Sep 10, 2024 by
ggerganov
Loading…
llama : initial Mamba-2 support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
testing
Everything test related
#9126
opened Aug 21, 2024 by
compilade
Loading…
8 of 9 tasks
Revert "ggml : remove OpenCL (#7735) + (#8235)"
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
script
Script related
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Added support to select GPU using metal on Apple Intel or Apple Silicon using --main-gpu index
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8962
opened Aug 10, 2024 by
ifeanyipossibilities
Loading…
2 of 4 tasks
Rebalancing Metal threads workload in dot product kernel kernel_mul_mv_f16_f32_l4
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7522
opened May 24, 2024 by
izard
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.