https://huggingface.co/inclusionAI/Ring-mini-2.0
#1381
by
nisten
- opened
this is a 1B active 16B moe model, not sure it will work but worth quantising never the less.
This model is of architecture BailingMoeV2ForCausalLM which is unfortunately not an architecture currently supported by llama.cpp