Skip to content

The new QX_K_M quants are producing gibberish #1091

Closed
@BadisG

Description

@BadisG

Hello,

I was trying this model that has the new types of GGUF quants:
https://huggingface.co/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-GGUF/tree/main

And it's not working for Q5_K_M, it produce gibberish:
image

This bug was supposed to be fixed 2 days ago there:
ggml-org/llama.cpp#4927

And I was using your latest version, which was bumped 11 hours ago (v0.2.29), so I thought it would work but it doesn't.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions