Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper • 2309.05516 • Published Sep 11, 2023 • 11
TeichAI/Qwen3-4B-Thinking-2507-MiniMax-M2.1-Distill Text Generation • 4B • Updated 6 days ago • 75 • 1