Package profile

gptqmodel

Summary: Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
Author: ModelCloud
Homepage: https://github.com/ModelCloud/GPTQModel
Source: https://github.com/ModelCloud/GPTQModel (Repo profile)
Number of releases: 36
First release: 1.0.1 on 2024-08-15
Latest release: 4.0.0 on 2025-08-22

Loading PyPI statistics...

Gptqmodel has 21 dependencies, 21 of which optional.