Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-1.7B-FP8
like
30
Follow
Qwen
57.2k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arXiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
d242af4
Qwen3-1.7B-FP8
2.67 GB
6 contributors
History:
11 commits
simon-mo
Remove vLLM FP8 Limitation
d242af4
verified
7 months ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
7 months ago
README.md
14.7 kB
Remove vLLM FP8 Limitation
7 months ago
config.json
894 Bytes
Upload folder using huggingface_hub
7 months ago
generation_config.json
239 Bytes
Update generation_config.json
7 months ago
merges.txt
1.67 MB
Upload folder using huggingface_hub
7 months ago
model.safetensors
2.65 GB
xet
Upload folder using huggingface_hub
7 months ago
tokenizer.json
11.4 MB
xet
Upload folder using huggingface_hub
7 months ago
tokenizer_config.json
9.68 kB
Upload folder using huggingface_hub
7 months ago
vocab.json
2.78 MB
Upload folder using huggingface_hub
7 months ago