Qwen3-0.6B-Thinking-GGUF : GGUF
This model was finetuned and converted to GGUF format using Unsloth.
Example usage:
- For text only LLMs:
./llama.cpp/llama-cli -hf Jackrong/Qwen3-0.6B-Thinking-GGUF --jinja - For multimodal models:
./llama.cpp/llama-mtmd-cli -hf Jackrong/Qwen3-0.6B-Thinking-GGUF --jinja
Available Model files:
qwen3-0.6b.Q8_0.gguf
Ollama
An Ollama Modelfile is included for easy deployment.
This was trained 2x faster with Unsloth

- Downloads last month
- 75
Hardware compatibility
Log In to add your hardware
4-bit
8-bit
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support