Request: Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4

by jart25 - opened Jul 30

Jul 30

Hi! Would it be possible to publish this variant, please? It would be greatly appreciated. Many thanks for your time QuantTrio!

QuantTrio org Jul 30

The loss of Int4 quantization is quite significant.

koesn

Aug 6

Same request, please? Would be greatly appreciated. Many thank's..

jart25

Aug 6

@koesn posted a version here:

jart25 changed discussion status to closed Aug 6

koesn

Aug 11

It doesn't work. It said group size 32, where LMDeploy only support group size 128.

QuantTrio org Aug 11

Our models are built with group size 128

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment