PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Package profile

gptqmodel

  • Summary: Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
  • Author: ModelCloud
  • Homepage: https://github.com/ModelCloud/GPTQModel
  • Source: https://github.com/ModelCloud/GPTQModel (Repo profile)
  • Number of releases: 36
  • First release: 1.0.1 on 2024-08-15
  • Latest release: 4.0.0 on 2025-08-22

Releases

Dates and sizes of releasesOctober2025AprilJulyRelease Date0.150.200.250.30Size in MB

PyPI Downloads

Loading PyPI statistics...

Dependencies

Gptqmodel has 21 dependencies, 21 of which optional.
Dependencies of gptqmodel (21).
DependencyOptional
auto_roundtrue
bitblastrue
clearmltrue
evalplustrue
fastapitrue
flashinfer-pythontrue
intel_extension_for_pytorchtrue
isorttrue
lm_evaltrue
mlx_lmtrue
optimumtrue
parameterizedtrue
plotlytrue
pydantictrue
pytesttrue
random_wordtrue
rufftrue
sglangtrue
tritontrue
uvicorntrue
vllmtrue

Details