Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LLucass
/
TT_L0.2_H0.2_dr_grpo
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
TT_L0.2_H0.2_dr_grpo
Commit History
End of training
bc8969e
verified
LLucass
commited on
Jun 8
Model save
aa5f5a3
verified
LLucass
commited on
Jun 8
Training in progress, step 200, checkpoint
fd4eeef
verified
LLucass
commited on
Jun 8
Training in progress, step 200
be242b8
verified
LLucass
commited on
Jun 8
Training in progress, step 150, checkpoint
5b530eb
verified
LLucass
commited on
Jun 8
Training in progress, step 150
ab298bb
verified
LLucass
commited on
Jun 8
Training in progress, step 100, checkpoint
dbed49b
verified
LLucass
commited on
Jun 8
Training in progress, step 100
2dab727
verified
LLucass
commited on
Jun 8
Training in progress, step 50, checkpoint
1b6f6d0
verified
LLucass
commited on
Jun 8
Training in progress, step 50
2e21f0d
verified
LLucass
commited on
Jun 8
initial commit
39d8928
verified
LLucass
commited on
Jun 8