Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RogerLos
/
verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_80
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
No model card
Downloads last month
14
Safetensors
Model size
0.6B params
Tensor type
F32
ยท
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Collection including
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_80
Long_CoT_Degradation_RL
Collection
119 items
โข
Updated
9 days ago