-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
server webui easy config selection
demo
Demonstrate some concept or idea, not intended to be merged
examples
server
#12031
opened Feb 22, 2025 by
poulphunter
Loading…
added rudimentary support for outetts v0.3 500m and 1b models
demo
Demonstrate some concept or idea, not intended to be merged
examples
#11287
opened Jan 18, 2025 by
LostRuins
Loading…
Refactor/tinyblas
build
Compilation issues
demo
Demonstrate some concept or idea, not intended to be merged
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Quantize: specify each major tensor quant in CLI for common LLMs
demo
Demonstrate some concept or idea, not intended to be merged
examples
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
build example/main.cpp as shared library and intercept token printing using FFI
demo
Demonstrate some concept or idea, not intended to be merged
examples
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#8339
opened Jul 6, 2024 by
mtasic85
Loading…
Implemented Spellcheck for Llama.cpp
demo
Demonstrate some concept or idea, not intended to be merged
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7884
opened Jun 11, 2024 by
Ferruolo
Loading…
2 of 4 tasks
Direct I/O and Transparent HugePages
demo
Demonstrate some concept or idea, not intended to be merged
examples
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
script
Script related
server
#7420
opened May 20, 2024 by
pavelfatin
Loading…
Updated server_queue to delete tasks from queue when server is shutdown. Feature Request #6421
demo
Demonstrate some concept or idea, not intended to be merged
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6941
opened Apr 27, 2024 by
rahsuri
Loading…
support MiniCPM-V-2
demo
Demonstrate some concept or idea, not intended to be merged
enhancement
New feature or request
examples
python
python script changes
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
#6919
opened Apr 26, 2024 by
Achazwl
Loading…
Refactor chat template API
demo
Demonstrate some concept or idea, not intended to be merged
refactoring
Refactoring
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
Control vectors in server
demo
Demonstrate some concept or idea, not intended to be merged
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#6289
opened Mar 24, 2024 by
trollkotze
Loading…
llama : compute BERT graph with F16 K, V
demo
Demonstrate some concept or idea, not intended to be merged
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
#5891
opened Mar 5, 2024 by
ggerganov
Loading…
server: feature Add Admin key parameter for slots/health/metrics
demo
Demonstrate some concept or idea, not intended to be merged
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
server/api
server
#5837
opened Mar 2, 2024 by
robeyh
Loading…
WIP: Add model Demonstrate some concept or idea, not intended to be merged
help wanted
Extra attention is needed
merge
example
demo
llama : add llama_kv_cache_compress
demo
Demonstrate some concept or idea, not intended to be merged
Server: add support for "tool_calls" (MeetKai/functionary model)
demo
Demonstrate some concept or idea, not intended to be merged
server/webui
llama : switch to floating-point token positions
demo
Demonstrate some concept or idea, not intended to be merged
refactoring
Refactoring
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
Add CUDA option to use the max release threshold for the default memory pool
demo
Demonstrate some concept or idea, not intended to be merged
#5429
opened Feb 9, 2024 by
YavorGIvanov
•
Draft
Layer skipping/self-speculation demo
demo
Demonstrate some concept or idea, not intended to be merged
research 🔬
#3565
opened Oct 10, 2023 by
KerfuffleV2
•
Draft
Adding SqueezeLLM Support
demo
Demonstrate some concept or idea, not intended to be merged
#3093
opened Sep 9, 2023 by
chooper1
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.