kevinshin's picture
Add converted safetensors from FSDP checkpoint (data/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-4-beta-0.01-wc-cw-3k-neg-rethink-pos/checkpoint-460/pytorch_model_fsdp_0)
8776438 verified