Qwen3VL
Collection
Nexa AI infra to support Qwen3VL running on GPU/NPU/CPU
•
22 items
•
Updated
•
3
Currently, only NexaSDK supports this GGUF.
nexa infer NexaAI/Qwen3-VL-2B-Instruct-GGUF
Qwen3-VL-2B-Instruct is a 2-billion-parameter, instruction-tuned vision-language model in the Qwen3-VL family. It’s designed for efficient multimodal understanding and generation—combining strong text skills with image and video perception—making it ideal for edge and on-device deployment. It supports long contexts (up to 256K tokens) and features upgraded architecture for better spatial, visual, and temporal reasoning.
Inputs
Outputs
This model is released under the Apache 2.0 License.
Please refer to the Hugging Face model card for detailed licensing and usage information.
6-bit
8-bit
16-bit
32-bit
Base model
Qwen/Qwen3-VL-2B-Instruct