Skip to content

imatrix : use GGUF to store importance matrices #9400

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 20 commits into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
20 commits
Select commit Hold shift + click to select a range
bce5464
imatrix : allow processing multiple chunks per batch
compilade Aug 20, 2024
347247a
imatrix : fix segfault when using a single chunk per batch
compilade Aug 20, 2024
3de9300
imatrix : use GGUF to store imatrix data
compilade Sep 6, 2024
c8ab6a3
imatrix : fix conversion problems
compilade Sep 8, 2024
3ad0603
Merge branch 'master' into compilade/imatrix-batched-chunks
compilade Sep 8, 2024
d19101c
imatrix : use FMA and sort tensor names
compilade Sep 8, 2024
503630e
py : add requirements for legacy imatrix convert script
compilade Sep 10, 2024
9e6b0e9
perplexity : revert changes
compilade Sep 10, 2024
894ed8d
py : include imatrix converter requirements in toplevel requirements
compilade Sep 10, 2024
efa9186
imatrix : avoid using designated initializers in C++
compilade Sep 10, 2024
2217247
imatrix : remove unused n_entries
compilade Sep 10, 2024
8c13e16
imatrix : allow loading mis-ordered tensors
compilade Sep 10, 2024
2d79a70
quantize : use unused imatrix chunk_size with LLAMA_TRACE
compilade Sep 10, 2024
c7a32e7
common : use GGUF for imatrix output by default
compilade Jan 31, 2025
db502dd
Merge branch 'master' into compilade/imatrix-batched-chunks
compilade Feb 9, 2025
1be357d
Merge branch 'master' into compilade/imatrix-batched-chunks
compilade Feb 9, 2025
16202d6
Merge branch 'master' into compilade/imatrix-batched-chunks
compilade Apr 13, 2025
a5165a6
imatrix : two-way conversion between old format and GGUF
compilade Apr 15, 2025
635f945
convert : remove imatrix to gguf python script
compilade Apr 15, 2025
1d19025
imatrix : use the function name in more error messages
compilade Apr 15, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading