5456es
/

selective_dpo_Llama-3.2-1B-Instruct_prune_0.5-sigmoid

preference-learning

Model card Files Files and versions

selective_dpo_Llama-3.2-1B-Instruct_prune_0.5-sigmoid / trainer_state.json

5456es's picture

Upload trainer_state.json with huggingface_hub

c0fd40b verified 2 months ago

390 kB

File too large to display, you can check the raw version instead.