Qwen2.5-0.5B-Instruct-GRPO-gsm8k / trainer_state.json
AIcell's picture
Model save
a54da2d verified
File too large to display, you can check the raw version instead.