Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kashif
/
Qwen3-0.6B-h128-l4-a16-ctx256-pred64-vocab4096-MeanScaleUniform-lr1.0e-05-bs16-steps1000
like
0
Text Generation
Transformers
Safetensors
qwen3
Generated from Trainer
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen3-0.6B-h128-l4-a16-ctx256-pred64-vocab4096-MeanScaleUniform-lr1.0e-05-bs16-steps1000
/
training_config.json
Commit History
Upload 4 files
074b52e
verified
kashif
HF Staff
commited on
Jun 26