-
Notifications
You must be signed in to change notification settings - Fork 12k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
sycl: Add reorder to Q6_K mmvq implementation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13885
opened May 29, 2025 by
s-Nick
Loading…
sycl: quantize and reorder the input to q8_1 when reorder is enabled
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13826
opened May 27, 2025 by
AD2605
Loading…
SYCL: Implement few same quantized type copy kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13739
opened May 24, 2025 by
qnixsynapse
•
Draft
remove templates from soft_max_f32_submitter to allow SYCL graph updates
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13724
opened May 23, 2025 by
lslusarczyk
Loading…
llama : try loading tensors with pre-computed hashes
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#13106
opened Apr 25, 2025 by
rgerganov
Loading…
tool-call
: Phi-4 support
android
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
ggml: move kvalues_iq4nl definition to ggml-common.h
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11785
opened Feb 10, 2025 by
HungMingWu
Loading…
Clean up Test Script + Update it to work on Instruct Tuned Models
examples
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11610
opened Feb 3, 2025 by
Mr-Thack
Loading…
[SYCL] pass SYCL CI
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#10041
opened Oct 25, 2024 by
airMeng
Loading…
2 of 4 tasks
add print cpu info
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#9957
opened Oct 20, 2024 by
NeoZhangJianyu
Loading…
2 of 4 tasks
[Draft] Tensor Parallel support to llama.cpp
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#9648
opened Sep 26, 2024 by
ClarkChin08
Loading…
1 of 3 tasks
Revert "ggml : remove OpenCL (#7735) + (#8235)"
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
script
Script related
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
ProTip!
no:milestone will show everything without a milestone.