Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-8B-FP8
like
44
Follow
Qwen
55.2k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
main
Qwen3-8B-FP8
Commit History
Create LICENSE
220b46e
verified
littlebird13
commited on
Jul 26
Update README.md
2df580c
verified
littlebird13
commited on
May 21
update tokenizer_config.json
a8f74df
feihu.hf
commited on
May 19
Remove vLLM FP8 Limitation (
#2
)
a29cae3
verified
jklj077
simon-mo
commited on
Apr 30
Update README.md
52c6f34
verified
yangapku
commited on
Apr 29
Update README.md
dc92dcc
verified
yangapku
commited on
Apr 28
Update README.md
c79002b
verified
littlebird13
commited on
Apr 28
Update README.md
3afac07
verified
jklj077
commited on
Apr 28
Delete special_tokens_map.json
02d9604
verified
littlebird13
commited on
Apr 28
Delete added_tokens.json
136b5b1
verified
littlebird13
commited on
Apr 28
Update generation_config.json
add3e8d
verified
littlebird13
commited on
Apr 28
Update README.md
7bc1816
verified
littlebird13
commited on
Apr 28
Update README.md
8b02ae0
verified
littlebird13
commited on
Apr 28
Upload folder using huggingface_hub
18a569c
verified
littlebird13
commited on
Apr 28
initial commit
e07edb6
verified
littlebird13
commited on
Apr 28