Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
14
2
Renjie
RogerLos
Follow
0 followers
·
1 following
AI & ML interests
LLM
Recent Activity
upvoted
a
paper
about 12 hours ago
The Principles of Diffusion Models
updated
a collection
about 18 hours ago
Long_CoT_Degradation_SFT
updated
a model
about 18 hours ago
RogerLos/curriculum_16k_long-cot_Qwen2.5-0.5B-Instruct
View all activity
Organizations
None yet
RogerLos
's models
366
Sort: Recently updated
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_70
2B
•
Updated
1 day ago
•
15
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_60
2B
•
Updated
1 day ago
•
10
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_50
2B
•
Updated
1 day ago
•
7
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_40
2B
•
Updated
1 day ago
•
8
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_30
2B
•
Updated
1 day ago
•
9
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_20
2B
•
Updated
1 day ago
•
13
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_110
2B
•
Updated
1 day ago
•
15
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_100
2B
•
Updated
1 day ago
•
8
RogerLos/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_10
2B
•
Updated
1 day ago
•
10
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_90
0.6B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_80
0.6B
•
Updated
1 day ago
•
14
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_70
0.6B
•
Updated
1 day ago
•
15
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_30
0.6B
•
Updated
1 day ago
•
13
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_110
0.6B
•
Updated
1 day ago
•
9
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_100
0.6B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-8k-Qwen2.5-0.5B-Instruct-global_step_10
0.6B
•
Updated
1 day ago
•
12
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_90
8B
•
Updated
1 day ago
•
6
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_80
8B
•
Updated
1 day ago
•
12
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_70
8B
•
Updated
1 day ago
•
6
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_60
8B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_50
8B
•
Updated
1 day ago
•
12
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_40
8B
•
Updated
1 day ago
•
6
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_30
8B
•
Updated
1 day ago
•
9
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_20
8B
•
Updated
1 day ago
•
9
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_110
8B
•
Updated
1 day ago
•
12
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_100
8B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-128k-Qwen2.5-7B-Instruct-global_step_10
8B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-128k-Qwen2.5-3B-Instruct-global_step_90
3B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-128k-Qwen2.5-3B-Instruct-global_step_80
3B
•
Updated
1 day ago
•
11
RogerLos/verl-grpo-128k-Qwen2.5-3B-Instruct-global_step_70
3B
•
Updated
1 day ago
•
15
Previous
1
2
3
4
5
...
13
Next