-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
remove templates from soft_max_f32_submitter to allow SYCL graph updates
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13724
opened May 23, 2025 by
lslusarczyk
Loading…
ggml : riscv: add xtheadvector support
ggml
changes relating to the ggml tensor library for machine learning
#13720
opened May 23, 2025 by
xctan
Loading…
ggml : add ggml_gelu_erf() CUDA kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13719
opened May 23, 2025 by
ngxson
Loading…
Replace alert and confirm with custom modals.
examples
server
#13711
opened May 22, 2025 by
igardev
Loading…
common/llama: align structures for reduce cacheline size on 64bit platforms
examples
server
#13710
opened May 22, 2025 by
GermanAizek
Loading…
add GGML_USE_NUMA_MIGRATE feature to optimize cross NUMA op computation
examples
ggml
changes relating to the ggml tensor library for machine learning
#13649
opened May 20, 2025 by
wenlujon
Loading…
MLA kv cache: fix split graph backend assignment when kv cache store on CPU
#13648
opened May 20, 2025 by
xiang1guo
Loading…
webui: Allow editing file attachments when editing messages.
examples
server
#13645
opened May 20, 2025 by
nauful
Loading…
sycl: add find_package call for OpenCL
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13643
opened May 19, 2025 by
AD2605
Loading…
sycl: Add more debug prints
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13640
opened May 19, 2025 by
Rbiessy
Loading…
[CANN]: add the basic supports of Flash Attention kernel
Ascend NPU
issues specific to Ascend NPUs
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#13627
opened May 19, 2025 by
shibizhao
Loading…
cuda: fix CMAKE_CUDA_COMPILER not found error (#13528)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13625
opened May 19, 2025 by
lizhenneng
Loading…
scripts: update pyproject.toml - deprecated poetry config + support uv
#13615
opened May 18, 2025 by
borgoat
Loading…
SYCL: Add non contiguous support in RMS_NORM and NORM kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13611
opened May 18, 2025 by
qnixsynapse
Loading…
ggml: aarch64: Implement SVE F32 kernels for Mamba Model
ggml
changes relating to the ggml tensor library for machine learning
#13602
opened May 17, 2025 by
vineelabhinav
Loading…
ggml : add memset_tensor for rpc
ggml
changes relating to the ggml tensor library for machine learning
#13601
opened May 17, 2025 by
gkpln3
Loading…
ggml : fix race-condition in ggml-rpc
ggml
changes relating to the ggml tensor library for machine learning
#13600
opened May 17, 2025 by
gkpln3
Loading…
server : separate the notion of position and KV tokens, remove prompt truncation
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
server
#13576
opened May 15, 2025 by
ngxson
Loading…
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method
python
python script changes
#13561
opened May 15, 2025 by
CISC
Loading…
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.