Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sayannath
/
Qwen2-0.5B-GRPO-test
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2-0.5B-GRPO-test
Commit History
Model save
e27093a
verified
sayannath
commited on
Sep 16
Training in progress, step 226
8827784
verified
sayannath
commited on
Sep 16
Training in progress, step 220
394169b
verified
sayannath
commited on
Sep 16
Training in progress, step 210
384bf17
verified
sayannath
commited on
Sep 16
Training in progress, step 200
05af71e
verified
sayannath
commited on
Sep 16
Training in progress, step 190
f5efa35
verified
sayannath
commited on
Sep 16
Training in progress, step 180
fc42b07
verified
sayannath
commited on
Sep 16
Training in progress, step 170
e6f1640
verified
sayannath
commited on
Sep 16
Training in progress, step 160
f028a2e
verified
sayannath
commited on
Sep 16
Training in progress, step 150
72b70ab
verified
sayannath
commited on
Sep 16
Training in progress, step 140
3ea4ed2
verified
sayannath
commited on
Sep 16
Training in progress, step 130
79b4a2f
verified
sayannath
commited on
Sep 16
Training in progress, step 120
a91c123
verified
sayannath
commited on
Sep 16
Training in progress, step 110
2fcca4f
verified
sayannath
commited on
Sep 16
Training in progress, step 100
1abbf55
verified
sayannath
commited on
Sep 16
Training in progress, step 90
b404e23
verified
sayannath
commited on
Sep 16
Training in progress, step 80
1a6683f
verified
sayannath
commited on
Sep 16
Training in progress, step 70
bee082f
verified
sayannath
commited on
Sep 16
Training in progress, step 60
dd8fbf2
verified
sayannath
commited on
Sep 16
Training in progress, step 50
c0ab3a8
verified
sayannath
commited on
Sep 16
Training in progress, step 40
7f3cfc3
verified
sayannath
commited on
Sep 16
Training in progress, step 30
38621f5
verified
sayannath
commited on
Sep 16
Training in progress, step 20
a1c7af7
verified
sayannath
commited on
Sep 15
Training in progress, step 10
a7ef51d
verified
sayannath
commited on
Sep 15
initial commit
a364d19
verified
sayannath
commited on
Sep 15