kevinshin's picture
Add converted safetensors from FSDP checkpoint (data/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-4-beta-0.5-wc-cw-3k-neg-rethink-pos/checkpoint-670/pytorch_model_fsdp_0)
da57ae6 verified