Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

ggml : refactor ggml-cpu.c into multiple C++ source files refactoring Refactoring roadmap Part of a roadmap project
#10180 opened Nov 5, 2024 by ggerganov
imatrix : use GGUF to store importance matrices breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. enhancement New feature or request examples python python script changes refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#9400 opened Sep 10, 2024 by compilade Draft
3 of 8 tasks
Refactor: Add more typechecking to GGUFWriter.add_key_value help wanted Extra attention is needed refactoring Refactoring
#9095 opened Aug 19, 2024 by mofosyne
Refactor: Existing examples refactoring opportunities help wanted Extra attention is needed refactoring Refactoring
#7559 opened May 27, 2024 by mofosyne
3 tasks
llama : support Jamba hybrid Transformer-Mamba models android Issues specific to Android embeddings embedding related topics enhancement New feature or request examples ggml changes relating to the ggml tensor library for machine learning model Model specific need feedback Testing and feedback with results are needed python python script changes refactoring Refactoring Review Complexity : High Generally require indepth knowledge of LLMs or GPUs server
#7531 opened May 25, 2024 by compilade Draft
7 of 17 tasks
common, ngram_cache: added const reference for std::pair<> and std::tuple<> more 16 bytes: refactoring Refactoring Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7270 opened May 14, 2024 by GermanAizek Loading…
ggml, ngram-cache, log: added const and const ref for function params refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#7269 opened May 14, 2024 by GermanAizek Loading…
tokenization: no double BOS tokens refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#7107 opened May 6, 2024 by JohannesGaessler Loading…
ggml : unified CMake build build Compilation issues enhancement New feature or request refactoring Refactoring roadmap Part of a roadmap project
#6913 opened Apr 25, 2024 by ggerganov
Refactor chat template API demo Demonstrate some concept or idea, not intended to be merged refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6822 opened Apr 22, 2024 by ngxson Draft
llama : switch to floating-point token positions demo Demonstrate some concept or idea, not intended to be merged refactoring Refactoring Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#5679 opened Feb 23, 2024 by ggerganov Draft
P-Step Truncation Sampling generation quality Quality of model output need feedback Testing and feedback with results are needed refactoring Refactoring Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#5675 opened Feb 23, 2024 by p-e-w Loading…
Fuse matrix multiplication + SiLU performance Speed related topics refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#5413 opened Feb 8, 2024 by JohannesGaessler Draft
llama : refactor the llm.build_xxx functions good first issue Good for newcomers refactoring Refactoring roadmap Part of a roadmap project
#5239 opened Jan 31, 2024 by ggerganov
llama : create llamax library refactoring Refactoring roadmap Part of a roadmap project
#5215 opened Jan 30, 2024 by ggerganov
llama : integer type consistency in llama.h enhancement New feature or request good first issue Good for newcomers refactoring Refactoring roadmap Part of a roadmap project
#4574 opened Dec 21, 2023 by MarcusDunn
llama : speed-up grammar sampling performance Speed related topics refactoring Refactoring roadmap Part of a roadmap project
#4218 opened Nov 25, 2023 by ggerganov
server : improvements and maintenance help wanted Extra attention is needed refactoring Refactoring roadmap Part of a roadmap project server/webui
#4216 opened Nov 25, 2023 by ggerganov
6 of 10 tasks
Avoid unused constant warnings refactoring Refactoring Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#2029 opened Jun 28, 2023 by set-soft Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.