Jakelolipopp
/

Llama-3.2-3B-Instruct-t-GRPO-v0.1-merge

Text Generation

text-generation-inference

Model card Files Files and versions

Llama-3.2-3B-Instruct-t-GRPO-v0.1-merge

6.44 GB

1 contributor

History: 4 commits

Jakelolipopp's picture

Trained with Unsloth

3766831 verified 6 months ago