ferdinandjasong
/

SuperCoder-7B-Qwen2.5-0525-peft-grpo-v2-merged

Text Generation

text-generation-inference

Model card Files Files and versions

Uploaded finetuned model

Developed by: ferdinandjasong
License: apache-2.0
Finetuned from model : nvidia/OpenCodeReasoning-Nemotron-7B

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 5

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for ferdinandjasong/SuperCoder-7B-Qwen2.5-0525-peft-grpo-v2-merged

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Finetuned

nvidia/OpenCodeReasoning-Nemotron-7B

Finetuned

(1)

this model

Quantizations

1 model