Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
purbeshmitra
/
vanillaGRPO
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
HuggingFaceH4/MATH-500
HuggingFaceH4/aime_2024
English
arxiv:
2507.02851
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
vanillaGRPO
148 MB
2 contributors
History:
13 commits
purbeshmitra
Update README.md
12d19a4
verified
4 months ago
assets
Rename multiround.png to assets/multiround.png
4 months ago
.gitattributes
Safe
1.75 kB
Rename multiround.png to assets/multiround.png
4 months ago
README.md
Safe
2.6 kB
Update README.md
4 months ago
adapter_config.json
Safe
876 Bytes
Upload 3 files
4 months ago
adapter_model.safetensors
Safe
148 MB
xet
Upload 3 files
4 months ago