GIT_FEED

ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

View on GitHub
0Active

On the radar — signal detected

Stars
1.1k
Forks
168
Contributors
84
Language
Python

Score updated Mar 26, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.