Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

JayHyeon
/
Qwen_1.5B-math-VDPO_5e-7_3.0vpo_constant-5ep

Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
Qwen_1.5B-math-VDPO_5e-7_3.0vpo_constant-5ep / runs
8.69 kB
  • 1 contributor
History: 1 commit
JayHyeon's picture
JayHyeon
Training in progress, step 185
7a514b1 verified 6 months ago
  • Jun13_03-16-06_01933a260f36
    Training in progress, step 185 6 months ago