Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5456es
/
selective_dpo_Llama-3.2-1B-Instruct_prune_0.5-sigmoid
like
0
Safetensors
llama
dpo
preference-learning
selective
pruned
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
e22f476
selective_dpo_Llama-3.2-1B-Instruct_prune_0.5-sigmoid
/
trainer_state.json
5456es
Upload trainer_state.json with huggingface_hub
c0fd40b
verified
2 months ago
raw
Copy download link
history
Safe
390 kB
File too large to display, you can
check the raw version
instead.