Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MultiRL
non-profit
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
iruno
updated
a dataset
about 8 hours ago
MultiRL/new_sudoku_many
iruno
published
a dataset
about 8 hours ago
MultiRL/new_sudoku_many
KimSHine
updated
a model
about 9 hours ago
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_geo_ms_6epoch
View all activity
Team members
3
models
93
Sort: Recently updated
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_geo_ms_6epoch
2B
•
Updated
about 9 hours ago
MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_5
2B
•
Updated
about 15 hours ago
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_geo_ms_token_tis
2B
•
Updated
1 day ago
•
183
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_gem_ms_seq_is
2B
•
Updated
3 days ago
•
273
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_mask_only
2B
•
Updated
3 days ago
•
246
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_995_98_ori_norm
2B
•
Updated
8 days ago
•
156
MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_995_98
2B
•
Updated
9 days ago
•
6
MultiRL/qwen3_1.7b_sft_final_easy_reinforce_ours_adv_fixed_gamma_0.9
2B
•
Updated
11 days ago
•
382
MultiRL/qwen3_1.7b_easy_rl_old_adv_fixed_gamma_1
2B
•
Updated
13 days ago
•
230
MultiRL/qwen3_1.7b_easy_rl_fixed_gamma_1
2B
•
Updated
15 days ago
•
135
View 93 models
datasets
31
Sort: Recently updated
MultiRL/new_sudoku_many
Viewer
•
Updated
about 8 hours ago
•
790
MultiRL/hard_short
Viewer
•
Updated
about 11 hours ago
•
100
MultiRL/easy_tooshort
Viewer
•
Updated
about 13 hours ago
•
420
MultiRL/easy_toolong
Viewer
•
Updated
about 14 hours ago
•
150
MultiRL/final_sudoku_medium_rl_hint
Viewer
•
Updated
about 15 hours ago
•
640
MultiRL/final_sudoku_sft_C_hint
Viewer
•
Updated
about 15 hours ago
•
800
MultiRL/final_sudoku_easy_rl_hint
Viewer
•
Updated
about 15 hours ago
•
320
MultiRL/final_sudoku_benchmark_hint
Viewer
•
Updated
1 day ago
•
515
•
12
MultiRL/rush_hour_benchmark
Viewer
•
Updated
8 days ago
•
150
•
31
MultiRL/rush_hour_hard_rl
Viewer
•
Updated
8 days ago
•
640
•
27
View 31 datasets