GuidedQuant

Smarter LLM Post-Training Quantization using End Loss Guidance, boosting the performance of
state-of-the-art weight-only scalar, weight-only vector, and weight-and-activation quantization methods.

News

May, 2025: GuidedQuant is accepted to ICML 2025.

Overview

GuidedQuant enhances LLM quantization by integrating gradient information from the end loss into the quantization objective, boosting the performance of SOTA weight-only scalar, weight-only vector, and weight-and-activation quantization. Additionally, we introduce LNQ, a non-uniform scalar quantization algorithm which is guaranteed to monotonically decrease the quantization objective value.

Installation & Usage

To be released soon.

Citation

Please cite our paper if you find our work useful:

@inproceedings{kim2025guidedquant,
      title={GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance}, 
      author={Jinuk Kim and Marwa El Halabi and Wonpyo Park and Clemens JS Schaefer and Deokjae Lee and Yeonhong Park and Jae W. Lee and Hyun Oh Song},
      booktitle = {International Conference on Machine Learning (ICML)},
      year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
any_precision		any_precision
assets		assets
inference		inference
qtip		qtip
spin_quant		spin_quant
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
full_nuq.py		full_nuq.py
layerwise_nuq.py		layerwise_nuq.py
quantize.py		quantize.py
requirements.txt		requirements.txt
run_eval.py		run_eval.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GuidedQuant

News

Overview

Installation & Usage

Citation

About

Releases

Packages

Languages

License

snu-mllab/GuidedQuant

Folders and files

Latest commit

History

Repository files navigation

GuidedQuant

News

Overview

Installation & Usage

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages