Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ermiaazarkhalili
/
llama-3.2-1b-instruct_grpo-GSM8K
like
0
Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
unsloth
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llama-3.2-1b-instruct_grpo-GSM8K
Commit History
(Trained with Unsloth)
4d3c38d
verified
ermiaazarkhalili
commited on
Jul 4
Unsloth Model Card
9dc36eb
verified
ermiaazarkhalili
commited on
Jul 4
Update README.md
e893fae
verified
ermiaazarkhalili
commited on
Jul 4
Update README.md
d8e95f8
verified
ermiaazarkhalili
commited on
Jun 29
Update README.md
9f625f1
verified
ermiaazarkhalili
commited on
Jun 29
Update README.md
95ee9a4
verified
ermiaazarkhalili
commited on
Jun 29
(Trained with Unsloth)
d94cc90
verified
ermiaazarkhalili
commited on
Jun 13
(Trained with Unsloth)
d16bb43
verified
ermiaazarkhalili
commited on
Jun 13
Unsloth Model Card
8a37cfc
verified
ermiaazarkhalili
commited on
Jun 13
initial commit
ca6e52d
verified
ermiaazarkhalili
commited on
Jun 13