Skip to content
This repository was archived by the owner on May 11, 2025. It is now read-only.
This repository was archived by the owner on May 11, 2025. It is now read-only.

Importance Matrix calculation for AWQ as well? #305

Closed as not planned
Closed as not planned
@sorasoras

Description

@sorasoras

ggml-org/llama.cpp#4856
ggml-org/llama.cpp#4861

There might be some interesting idea you can use to improve AutoAWQ
It does looks pretty promising
or combine them to generate a quantization.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions