Qwen / Qwen3.5-0.8B

QUANTIZED BY: UnstableLlama
Information
⚠️ Requires ExLlamaV3 v0.0.23 (or v0.0.22 `dev` branch)
exl3 quantizations of Qwen3.5-0.8B via exllamav3.
repo generated automatically with ezexl3.
Repo Data
Quantization graph
REVISION GiB KL DIV PPL
2.10bpw 0.96 0.4642 26.2416
3.10bpw 1.02 0.0800 19.0087
4.00bpw 1.07 0.0238 17.8643
5.00bpw 1.13 0.0067 17.8094
6.00bpw 1.19 0.0023 17.6048
8.00bpw 1.31 0.0009 17.6475
bf16 1.63 0.0000 17.6029
CLI Download
huggingface-cli download UnstableLlama/Qwen3.5-0.8B-exl3 --revision "4.00bpw" --local-dir ./
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for UnstableLlama/Qwen3.5-0.8B-exl3

Finetuned
Qwen/Qwen3.5-0.8B
Quantized
(54)
this model