turboderp's picture
Update README.md
795a278 verified
metadata
license: apache-2.0
base_model: Qwen/Qwen3-Next-80B-A3B-Thinking
base_model_relation: quantized
quantized_by: turboderp
tags:
  - exl3

EXL3 quants of Qwen3-Next-80B-A3B-Thinking

⚠️ Requires ExLlamaV3 v0.0.7 (or v0.0.6 dev branch)

Base bitrates:

2.00 bits per weight
3.00 bits per weight
4.00 bits per weight
5.00 bits per weight
6.00 bits per weight

Optimized:

2.08 bits per weight
2.27 bits per weight
2.78 bits per weight
3.14 bits per weight
3.53 bits per weight
4.06 bits per weight
4.51 bits per weight

kld