Yuhan123/olmo-cad-rm-cad-maj-vote-eval-acc-0-9065-cad-rm-cad-maj-vote-eval-acc-0-9065-1-steps-20000 Text Generation • 1B • Updated 23 days ago • 13
Yuhan123/olmo-cad-checkpoint-460-cad-rm-cad-labels-0-eval-acc-0-8385-checkpoint-460-1-steps-20000 Text Generation • 1B • Updated 23 days ago • 14
Yuhan123/olmo-cad-checkpoint-360-cad-rm-cad-labels-1-eval-acc-0-8354-checkpoint-360-1-steps-20000 Text Generation • 1B • Updated 23 days ago • 20
Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-preschool-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 7
Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-7th-grade-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 6
Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-12th-grade-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 5
Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-7th-grade-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 6
Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-12th-grade-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 6
Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-7th-grade-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 5
Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-12th-grade-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 5
Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-gradschool-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 5
Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-preschool-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 7
Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-preschool-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 6
Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-gradschool-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 8
Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-gradschool-1-steps-1000 Text Generation • 1B • Updated Jul 17 • 6
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.316 Text Generation • 3B • Updated May 27 • 6
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.229 Text Generation • 3B • Updated May 27 • 6
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.340 Text Generation • 3B • Updated May 27 • 5
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.309 Text Generation • 3B • Updated May 27 • 6
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.361 Text Generation • 3B • Updated May 27 • 6
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.383 Text Generation • 3B • Updated May 27 • 6
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.398 Text Generation • 3B • Updated May 27 • 4