Original Model Link : dervig/m51Lab-MiniMax-M2.7-REAP-139B-A10B

name: MiniMax-M2.7-REAP-139B-A10B-5bit
base_model: MiniMaxAI/MiniMax-M2.7
license: other
license_name: modified-mit
license_link: https://huggingface.co/MiniMaxAI/MiniMax-M2.7/blob/main/LICENSE
pipeline_tag: text-generation
tasks: text-generation
language: en
library_name: mlx
size: 95GB
tags:
- Cerebras
- MiniMaxAI
- M2.7
- REAP
- MLX
- static quantization
- 4-bit
- moe
- pruning
- text-generation
- mlx

Description

This is a 230 billion parameter MiniMax M2.7 model with 40% of its experts pruned with REAP (Router-weighted Expert Activation Pruning), then converted to MLX with mlx_lm.

Conversion sequence using source version of mlx_lm@0.31.3 :

mlx_lm.convert --hf-path dervig/m51Lab-MiniMax-M2.7-REAP-139B-A10B --mlx-path ~/Downloads/MiniMax-M2.7-REAP-139B-A10B-MLX-5bit -q --q-bits 5
Downloads last month
212
Safetensors
Model size
139B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for exdysa/MiniMax-M2.7-REAP-139B-A10B-MLX-5bit

Quantized
(107)
this model

Collection including exdysa/MiniMax-M2.7-REAP-139B-A10B-MLX-5bit