Qwen3VL
Collection
Nexa AI infra to support Qwen3VL running on GPU/NPU/CPU
•
22 items
•
Updated
•
3
Run Qwen3-VL-4B-Instruct optimized for Apple Silicon on MLX with NexaSDK.
Install NexaSDK
Run the model locally with one line of code:
nexa infer NexaAI/qwen3vl-4B-fp16-mlx
Qwen3-VL-4B-Instruct is a 4-billion-parameter instruction-tuned multimodal large language model from Alibaba Cloud’s Qwen team.
As part of the Qwen3-VL series, it fuses powerful vision-language understanding with conversational fine-tuning, optimized for real-world applications such as chat-based reasoning, document analysis, and visual dialogue.
The Instruct variant is tuned for following user prompts naturally and safely — producing concise, relevant, and user-aligned responses across text, image, and video contexts.
Input:
Output:
Refer to the official Qwen license for terms of use and redistribution.
Base model
Qwen/Qwen3-VL-4B-Instruct