PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Package profile

gptqmodel

  • Summary: Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
  • Author: ModelCloud
  • Homepage: https://github.com/ModelCloud/GPTQModel
  • Source: https://github.com/ModelCloud/GPTQModel (Repo profile)
  • Number of releases: 35
  • First release: 1.0.1 on 2024-08-15
  • Latest release: 2.2.0 on 2025-04-03

Releases

Dates and sizes of releasesOctober2025AprilJulyRelease Date0.160.180.200.220.240.260.280.300.32Size in MB

PyPI Downloads

Weekly downloads over the last 3 monthsFebruaryMarchAprilMayJuneDate0123456789 thousand downloads per week

Dependencies

Gptqmodel has 21 dependencies, 21 of which optional.
Dependencies of gptqmodel (21).
DependencyOptional
auto_roundtrue
bitblastrue
clearmltrue
evalplustrue
fastapitrue
flashinfer-pythontrue
intel_extension_for_pytorchtrue
isorttrue
lm_evaltrue
mlx_lmtrue
optimumtrue
parameterizedtrue
plotlytrue
pydantictrue
pytesttrue
random_wordtrue
rufftrue
sglangtrue
tritontrue
uvicorntrue
vllmtrue

Details