PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Repository info

GPTQModel

  • Summary: a summary of GPTQModel
  • Description: LLM model compression/quantization toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
  • Stars: 767
  • Number of forks: 112