Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
purbeshmitra
/
vanillaGRPO
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
HuggingFaceH4/MATH-500
HuggingFaceH4/aime_2024
English
arxiv:
2507.02851
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
vanillaGRPO
/
adapter_model.safetensors
Commit History
Upload 3 files
eb940ae
verified
purbeshmitra
commited on
Jul 6