AI & ML interests
AI4Science
Organizations
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v39__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v40__steps_10000__bs_56__lr_5e7__seed_42
Text Generation
•
3B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v41__steps_10000__bs_56__lr_5e7__seed_42
Text Generation
•
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v37__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v36__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v38__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v33__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v34__steps_10000__bs_56__lr_5e7__seed_42
8B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v35__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v32__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
3
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v25__steps_10000__bs_56__lr_5e7__seed_42
8B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v24__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v30__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
4
khuang2/qwen-2.5-3b-r1-countdown_v30__steps_10000__bs_224__lr_5e7__seed_42
Updated
khuang2/qwen-2.5-3b-r1-countdown_v31__steps_10000__bs_224__lr_5e7__seed_42
8B
•
Updated
•
2
khuang2/qwen-2.5-3b-r1-countdown_v29__steps_10000__bs_224__lr_5e7__seed_42
2B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown_v26__steps_10000__bs_224__lr_5e7__seed_42
2B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown_v28__steps_10000__bs_224__lr_5e7__seed_42
8B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown_v27__steps_10000__bs_224__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v20__steps_10000__bs_56__lr_5e7__seed_42
8B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v18__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown_v19__steps_10000__bs_224__lr_5e7__seed_42
2B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown_v21__steps_10000__bs_224__lr_5e7__seed_42
2B
•
Updated
•
61
khuang2/qwen-2.5-3b-r1-countdown_v20__steps_10000__bs_224__lr_5e7__seed_42
8B
•
Updated
•
3
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v23__steps_10000__bs_56__lr_5e7__seed_42
2B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v22__steps_10000__bs_56__lr_5e7__seed_42
2B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v21__steps_10000__bs_56__lr_5e7__seed_42
2B
•
Updated
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v15__steps_10000__bs_56__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown_v15__steps_10000__bs_224__lr_5e7__seed_42
3B
•
Updated
•
1
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v19__steps_10000__bs_56__lr_5e7__seed_42
Updated