PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Repository info

GPTQModel

Summary: a summary of GPTQModel
Description: LLM model compression/quantization toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
Stars: 767
Number of forks: 112