This repository was archived by the owner on May 11, 2025. It is now read-only.
This repository was archived by the owner on May 11, 2025. It is now read-only.
Importance Matrix calculation for AWQ as well? #305
Closed as not planned
Description
ggml-org/llama.cpp#4856
ggml-org/llama.cpp#4861
There might be some interesting idea you can use to improve AutoAWQ
It does looks pretty promising
or combine them to generate a quantization.
Metadata
Metadata
Assignees
Labels
No labels