GRPO RL model
SunJack
SunJack
·
AI & ML interests
None yet
Organizations
models
14
SunJack/Qwen2.5-3B-R1-GGUF
3B
•
Updated
•
15
SunJack/Qwen2.5-3B-R1
Updated
•
4
SunJack/Phi-4-R1
Updated
SunJack/Phi-4-R1-GGUF
Updated
SunJack/Qwen2.5-7b-sft
Updated
•
3
SunJack/phi4-o1
15B
•
Updated
•
35
SunJack/Qwen2.5-3B-GRPO_lora
Updated
SunJack/qwen2.5-7b-o1
8B
•
Updated
•
15
•
1
SunJack/qwen2.5-7b-cve
8B
•
Updated
•
23
•
1
SunJack/qwen2-7b-ruozhiba-finetuning
8B
•
Updated
•
40
•
2