-
Notifications
You must be signed in to change notification settings - Fork 12k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ggml : refactor ggml-cpu.c into multiple C++ source files
refactoring
Refactoring
roadmap
Part of a roadmap project
#10180
opened Nov 5, 2024 by
ggerganov
imatrix : use GGUF to store importance matrices
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
enhancement
New feature or request
examples
python
python script changes
refactoring
Refactoring
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
Refactor: Add more typechecking to GGUFWriter.add_key_value
help wanted
Extra attention is needed
refactoring
Refactoring
#9095
opened Aug 19, 2024 by
mofosyne
Refactor: Existing examples refactoring opportunities
help wanted
Extra attention is needed
refactoring
Refactoring
#7559
opened May 27, 2024 by
mofosyne
3 tasks
llama : support Jamba hybrid Transformer-Mamba models
android
Issues specific to Android
embeddings
embedding related topics
enhancement
New feature or request
examples
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
need feedback
Testing and feedback with results are needed
python
python script changes
refactoring
Refactoring
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
server
common, ngram_cache: added const reference for std::pair<> and std::tuple<> more 16 bytes:
refactoring
Refactoring
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7270
opened May 14, 2024 by
GermanAizek
Loading…
ggml, ngram-cache, log: added const and const ref for function params
refactoring
Refactoring
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7269
opened May 14, 2024 by
GermanAizek
Loading…
tokenization: no double BOS tokens
refactoring
Refactoring
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7107
opened May 6, 2024 by
JohannesGaessler
Loading…
ggml : unified CMake build
build
Compilation issues
enhancement
New feature or request
refactoring
Refactoring
roadmap
Part of a roadmap project
#6913
opened Apr 25, 2024 by
ggerganov
Refactor chat template API
demo
Demonstrate some concept or idea, not intended to be merged
refactoring
Refactoring
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
llama : switch to floating-point token positions
demo
Demonstrate some concept or idea, not intended to be merged
refactoring
Refactoring
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
P-Step Truncation Sampling
generation quality
Quality of model output
need feedback
Testing and feedback with results are needed
refactoring
Refactoring
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
#5675
opened Feb 23, 2024 by
p-e-w
Loading…
Fuse matrix multiplication + SiLU
performance
Speed related topics
refactoring
Refactoring
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#5413
opened Feb 8, 2024 by
JohannesGaessler
•
Draft
llama : refactor the llm.build_xxx functions
good first issue
Good for newcomers
refactoring
Refactoring
roadmap
Part of a roadmap project
#5239
opened Jan 31, 2024 by
ggerganov
llama : create llamax library
refactoring
Refactoring
roadmap
Part of a roadmap project
#5215
opened Jan 30, 2024 by
ggerganov
llama : integer type consistency in New feature or request
good first issue
Good for newcomers
refactoring
Refactoring
roadmap
Part of a roadmap project
llama.h
enhancement
#4574
opened Dec 21, 2023 by
MarcusDunn
llama : speed-up grammar sampling
performance
Speed related topics
refactoring
Refactoring
roadmap
Part of a roadmap project
#4218
opened Nov 25, 2023 by
ggerganov
server : improvements and maintenance
help wanted
Extra attention is needed
refactoring
Refactoring
roadmap
Part of a roadmap project
server/webui
#4216
opened Nov 25, 2023 by
ggerganov
6 of 10 tasks
Avoid unused constant warnings
refactoring
Refactoring
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#2029
opened Jun 28, 2023 by
set-soft
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.