view article Article Introducing AutoRound: Intelβs Advanced Quantization for LLMs and VLMs Apr 29 β’ 40
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper β’ 2309.05516 β’ Published Sep 11, 2023 β’ 10