lm_head.weight is missing from weight_map in model.safetensors.index.json
#15 opened 6 days ago
by
zhaosiyuan
GIT_LFS_SKIP_SMUDGE=1 git clone [email protected]:Qwen/Qwen3-4B
#12 opened 3 months ago
by
Ahmedkash
Why is there a chat template for a base model?
1
#11 opened 4 months ago
by
dsouzaJithesh
Add assistant mask support to Qwen3-4B
#9 opened 5 months ago
by
waleko
UnslothVisionDataCollator problem
2
#8 opened 5 months ago
by
orkungedik
Translation task in low-resource language can be done pretty well
#7 opened 6 months ago
by
luweigen
Collections of Qwen3 4B model Bad Cases User Reviews and Comments
😔
1
#5 opened 6 months ago
by
DeepNLP
YaRN: is "performance" referring to quality or speed?
👀
1
#4 opened 6 months ago
by
kmouratidis
Use the more common reverse filter in template
#3 opened 6 months ago
by
tahayassine
【Evaluation】Best practice for evaluating Qwen3 !!
🔥
🚀
2
#2 opened 6 months ago
by
wangxingjun778
Add languages tag
#1 opened 6 months ago
by
de-francophones