mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-140 8B • Updated 18 days ago • 13
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-190 8B • Updated 18 days ago • 13
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-220 8B • Updated 18 days ago • 14
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-250 8B • Updated 18 days ago • 11
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-180 8B • Updated 18 days ago • 12
mlfoundations-cua-dev/qwen3_vl_30b_grpo-stage-1-on-103k-filtered-data-dynamic-sampling-partial-data Updated 18 days ago
mlfoundations-cua-dev/qwen2_5vl_7b_110k_plus_agentnet_clicks_lr_1_0e-06_z3_4nodes 8B • Updated 20 days ago • 12
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-120 8B • Updated 24 days ago • 13
mlfoundations-cua-dev/63k-nores-jedi-5k-refusal-train-subsample Viewer • Updated 9 days ago • 1k • 14
mlfoundations-cua-dev/qwen3-resize-easyr1-110k-bbox0p05-remove-pixmo-uground-seeclick-normalized Viewer • Updated 18 days ago • 101k • 137