Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Menlo
/
ReZero-v0.1-llama-3.2-3b-it-grpo-250404-gguf
like
4
Follow
Menlo Research
638
Transformers
GGUF
English
llama
text-generation-inference
unsloth
conversational
arxiv:
2504.11001
License:
llama3.2
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
52c2705
ReZero-v0.1-llama-3.2-3b-it-grpo-250404-gguf
/
unsloth.Q4_K_M.gguf
Commit History
(Trained with Unsloth)
d386667
verified
thinhlpg
commited on
Apr 7