Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Cache based tokenization for the server input prompts demo Demonstrate some concept or idea, not intended to be merged examples server
#12067 opened Feb 25, 2025 by vnicolici Loading…
server webui easy config selection demo Demonstrate some concept or idea, not intended to be merged examples server
#12031 opened Feb 22, 2025 by poulphunter Loading…
added rudimentary support for outetts v0.3 500m and 1b models demo Demonstrate some concept or idea, not intended to be merged examples
#11287 opened Jan 18, 2025 by LostRuins Loading…
Refactor/tinyblas build Compilation issues demo Demonstrate some concept or idea, not intended to be merged documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#10343 opened Nov 16, 2024 by Djip007 Draft
2 of 4 tasks
speculative : experiments with Qwen2.5-Coder demo Demonstrate some concept or idea, not intended to be merged examples
#10290 opened Nov 14, 2024 by ggerganov Draft
main : add new feature: special commands demo Demonstrate some concept or idea, not intended to be merged examples
#10145 opened Nov 3, 2024 by ngxson Draft
2 tasks done
Quantize: specify each major tensor quant in CLI for common LLMs demo Demonstrate some concept or idea, not intended to be merged examples Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#8917 opened Aug 7, 2024 by Nexesenex Draft
2 of 4 tasks
build example/main.cpp as shared library and intercept token printing using FFI demo Demonstrate some concept or idea, not intended to be merged examples Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#8339 opened Jul 6, 2024 by mtasic85 Loading…
Implemented Spellcheck for Llama.cpp demo Demonstrate some concept or idea, not intended to be merged examples Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7884 opened Jun 11, 2024 by Ferruolo Loading…
2 of 4 tasks
Direct I/O and Transparent HugePages demo Demonstrate some concept or idea, not intended to be merged examples python python script changes Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level script Script related server
#7420 opened May 20, 2024 by pavelfatin Loading…
Updated server_queue to delete tasks from queue when server is shutdown. Feature Request #6421 demo Demonstrate some concept or idea, not intended to be merged Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#6941 opened Apr 27, 2024 by rahsuri Loading…
support MiniCPM-V-2 demo Demonstrate some concept or idea, not intended to be merged enhancement New feature or request examples python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#6919 opened Apr 26, 2024 by Achazwl Loading…
Refactor chat template API demo Demonstrate some concept or idea, not intended to be merged refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6822 opened Apr 22, 2024 by ngxson Draft
Control vectors in server demo Demonstrate some concept or idea, not intended to be merged Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6289 opened Mar 24, 2024 by trollkotze Loading…
llama : compute BERT graph with F16 K, V demo Demonstrate some concept or idea, not intended to be merged Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#5891 opened Mar 5, 2024 by ggerganov Loading…
IQ3_S: multiplier based code book demo Demonstrate some concept or idea, not intended to be merged
#5867 opened Mar 4, 2024 by ikawrakow Draft
server: feature Add Admin key parameter for slots/health/metrics demo Demonstrate some concept or idea, not intended to be merged Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level server/api server
#5837 opened Mar 2, 2024 by robeyh Loading…
WIP: Add model merge example demo Demonstrate some concept or idea, not intended to be merged help wanted Extra attention is needed
#5741 opened Feb 26, 2024 by ngxson Draft
llama : add llama_kv_cache_compress demo Demonstrate some concept or idea, not intended to be merged
#5719 opened Feb 25, 2024 by ggerganov Draft
Server: add support for "tool_calls" (MeetKai/functionary model) demo Demonstrate some concept or idea, not intended to be merged server/webui
#5695 opened Feb 23, 2024 by ngxson Draft
llama : switch to floating-point token positions demo Demonstrate some concept or idea, not intended to be merged refactoring Refactoring Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#5679 opened Feb 23, 2024 by ggerganov Draft
Add CUDA option to use the max release threshold for the default memory pool demo Demonstrate some concept or idea, not intended to be merged
#5429 opened Feb 9, 2024 by YavorGIvanov Draft
Layer skipping/self-speculation demo demo Demonstrate some concept or idea, not intended to be merged research 🔬
#3565 opened Oct 10, 2023 by KerfuffleV2 Draft
llama : store non-RoPEd K cache demo Demonstrate some concept or idea, not intended to be merged
#3234 opened Sep 17, 2023 by ggerganov Draft
Adding SqueezeLLM Support demo Demonstrate some concept or idea, not intended to be merged
#3093 opened Sep 9, 2023 by chooper1 Loading…
ProTip! no:milestone will show everything without a milestone.