Repository info
GPTQModel
- Summary: a summary of GPTQModel
- Description: LLM model compression/quantization toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
- Stars: 767
- Number of forks: 112