DavidAU/Qwen3.5-13B-Strict-Instruct

#2025
by VaLtEc-BoY - opened

IMPORTANT: This model has an upgraded Jinja template which repairs issues with org model (repeats, long thinking, loops) and upgrades/repairs to tool handling.

Qwen3.5-13B-Strict-Instruct
This model contains TWO fine tunes (Deepseek 9B, Polaris 9B), merged AND expanded to 13B parameters (48 layers, 639 tensors).

This has altered reasoning/thinking block as well as thinking/reasoning block size. (reduced / improved)

This has also deepened model logic, and thinking too.

This is a BETA version, as further training on this merged model may improve quality/metrics.

The model's default mode however is "INSTRUCT" ; this can be changed by editing the following line in the jinja template:

{%- set enable_thinking = false %}

to

{%- set enable_thinking = true %}

(In LMStudio, this can be edited after loading the model, in dev mode -> template)

Every attempt was made to ensure the training was "mild" and did not negatively affect the model's already incrediblely strong benchmarks.

Vision (images) tested -> working with new training.

ONE example generation at the bottom of this page.

I was a bit confused with the request, but I queued

You can check for progress at http://hf.tst.eu/status.html or regularly check the model
summary page at https://hf.tst.eu/model#Qwen3.5-13B-Strict-Instruct-GGUF for quants to appear.

Sign up or log in to comment