Read news on bit quantization with our app.
Read more in the app
SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs
Run DeepSeek R1 Dynamic 1.58-bit
VPTQ: Extreme low-bit Quantization for real LLMs