Shihan Qu
zenmagnets
AI & ML interests
None yet
Recent Activity
new activity about 9 hours ago
Qwen/Qwen3.5-397B-A17B:Qwen3.6 397b new activity about 1 month ago
mmangkad/Qwen3.6-27B-NVFP4:31gb NVFP4 Model? liked a model about 1 month ago
NinjaBoffin/MiniMax-M2.7-NVFP4Organizations
None yet
Qwen3.6 397b
3
#75 opened about 1 month ago
by
zenmagnets
31gb NVFP4 Model?
4
#1 opened about 1 month ago
by
zenmagnets
license
👀👍 9
16
#5 opened about 1 month ago
by
festr2
Pending GPU & vLLM validation
3
#1 opened about 2 months ago
by
nwzjk
No commercial use allowed in License?
👀😔 4
10
#6 opened about 1 month ago
by
zenmagnets
How to run on vLLM for 4xSM120
#1 opened 3 months ago
by
zenmagnets
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍 3
17
#1 opened 3 months ago
by
zenmagnets
Anyone get this working on 4x RTX 6000 Pro?
👀 2
5
#1 opened 3 months ago
by
zenmagnets
Throughput NVFP4 on Dual 6000 Blackwells
#2 opened 3 months ago
by
zenmagnets
Anyone try this on 4x RTX 6000 Pro yet?
52
#1 opened 3 months ago
by
zenmagnets
I wish it would fit in 2x6000 PRO!
1
#2 opened 3 months ago
by
mtcl
"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."
👍 1
21
#2 opened 3 months ago
by
zenmagnets
Wasn't able to recreate MMLU-Pro benchmarks
5
#5 opened 4 months ago
by
zenmagnets
Enormous KV-cache size?
👍➕ 6
23
#3 opened 4 months ago
by
nephepritou
Really appreciate that you ran performance comparison tests with BF16!
3
#2 opened 4 months ago
by
zenmagnets
Performance comps with BF16?
1
#3 opened 4 months ago
by
zenmagnets
Any plans for a 6bit or 8bit version?
1
#3 opened 4 months ago
by
zenmagnets
If 8bit, why shaped like 16 bit
2
#2 opened 4 months ago
by
zenmagnets
6 months since intro of NVFP4, and it's basically still a myth
1
#4 opened 6 months ago
by
zenmagnets
Works with vllm? Any recommendations or howtos?
7
#1 opened 7 months ago
by
DrRos