majentik
/

Leanstral-RotorQuant-MLX-8bit

Text Generation

kv-cache-quantization

weight-quantization

theorem-proving

Mixture of Experts

8-bit precision

Model card Files Files and versions

Leanstral-RotorQuant-MLX-8bit / tokenizer.json

Commit History

Add MLX 8-bit quantized model with KV cache compression

cfb84ae
verified

majentik commited on Apr 13