Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
19
14
7
Xirui Li
PRO
AIcell
Follow
21world's profile picture
Dolphin42's profile picture
Gargaz's profile picture
4 followers
·
13 following
https://xirui-li.github.io/
xiruili7_li
xirui-li
AI & ML interests
Foundation LLM and VLM
Recent Activity
upvoted
a
paper
3 days ago
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
updated
a model
3 days ago
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
published
a model
5 days ago
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
View all activity
Organizations
AIcell
's models
26
Sort: Recently updated
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
2B
•
Updated
3 days ago
•
24
AIcell/Qwen-1.5B-Instruct-GRPO-Majority
2B
•
Updated
6 days ago
•
8
AIcell/Qwen-1.5B-Instruct-GRPO-Random
2B
•
Updated
7 days ago
•
10
AIcell/Qwen-1.5B-Instruct-GRPO
2B
•
Updated
7 days ago
•
21
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-non-reasoning
2B
•
Updated
13 days ago
•
27
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-opposite
2B
•
Updated
14 days ago
•
8
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-random
2B
•
Updated
16 days ago
•
11
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
21 days ago
•
43
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k
Text Generation
•
2B
•
Updated
25 days ago
•
31
AIcell/Qwen2.5-0.5B-Instruct-GRPO-gsm8k
Text Generation
•
0.5B
•
Updated
25 days ago
•
54
AIcell/Qwen2.5-3B-Instruct-GRPO-gsm8k
Updated
27 days ago
AIcell/Qwen2.5-1.5B-Instruct-GRPO-DAPO17k-thinking
2B
•
Updated
Oct 6
•
5
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math220k-thinking
Text Generation
•
2B
•
Updated
Oct 5
•
3
AIcell/Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k
Text Generation
•
2B
•
Updated
Sep 29
•
4
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-random-reward
Text Generation
•
2B
•
Updated
Sep 26
•
3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-no-thinking
2B
•
Updated
Sep 26
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-monitor
Text Generation
•
2B
•
Updated
Sep 12
•
2
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-plain
Text Generation
•
2B
•
Updated
Sep 12
•
9
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-GPQA-Diamond-thinking
Updated
Aug 21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-MATH-500-thinking
Updated
Aug 21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-thinking
Updated
Aug 20
AIcell/Qwen2.5-1.5B-Base-GRPO-Math12k
Updated
Jul 3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-no-thinkng
Updated
Jul 3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k
Updated
Jul 1
AIcell/Qwen2.5-1.5B-Instruct-GRPO
Updated
Jul 1
AIcell/Qwen2.5-Math-1.5B-GRPO
Updated
Jun 1