PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Repository info

auto-round

  • Summary: a summary of auto-round
  • Description: Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM. Export your models effortlessly to autogptq, autoawq, gguf and autoround formats with high accuracy even at extremely low bit precision.
  • Stars: 611
  • Number of forks: 52