CPU only?

#2
by jujutechnology - opened

Is it true that this runs on cpu only?

Intel org
β€’
edited Sep 17

CPU/CUDA

Intel org

Is it possible to pack Qwen3-Next-80B-A3B-Thinking-int4-AutoRound into Openvino format?

Would be the icing on the cake

Intel org

Is it possible to pack Qwen3-Next-80B-A3B-Thinking-int4-AutoRound into Openvino format?

Would be the icing on the cake

Suggest adding the support of recognizing the autoround format (see the bottom lines of config.json), which is compatible with GPTQ. Meanwhile, you may consider supporting the mixed precision: MoE Linear INT4, and non-MoE linears INT8. Happy to collaborate to enable AutoRound into OpenVINO!

Sign up or log in to comment