5456es
/

implicit_reward_Qwen2.5-0.5B-Instruct_prune_0.5-sigmoid

preference-learning

Model card Files Files and versions

implicit_reward_Qwen2.5-0.5B-Instruct_prune_0.5-sigmoid / trainer_state.json

5456es's picture

Upload trainer_state.json with huggingface_hub

60d8fd1 verified 2 months ago

history contribute delete

391 kB

File too large to display, you can check the raw version instead.