Xirui Li's picture

Xirui Li PRO

AIcell

·

https://xirui-li.github.io/

AI & ML interests

Foundation LLM and VLM

Recent Activity

upvoted a paper 3 days ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

updated a model 3 days ago

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority

published a model 5 days ago

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority

View all activity

Organizations

AIcell 's models 26

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority

2B • Updated 3 days ago • 24

AIcell/Qwen-1.5B-Instruct-GRPO-Majority

2B • Updated 6 days ago • 8

AIcell/Qwen-1.5B-Instruct-GRPO-Random

2B • Updated 7 days ago • 10

AIcell/Qwen-1.5B-Instruct-GRPO

2B • Updated 7 days ago • 21

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-non-reasoning

2B • Updated 13 days ago • 27

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-opposite

2B • Updated 14 days ago • 8

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-random

2B • Updated 16 days ago • 11

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

2B • Updated 21 days ago • 43

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k

Text Generation • 2B • Updated 25 days ago • 31

AIcell/Qwen2.5-0.5B-Instruct-GRPO-gsm8k

Text Generation • 0.5B • Updated 25 days ago • 54

AIcell/Qwen2.5-3B-Instruct-GRPO-gsm8k

Updated 27 days ago

AIcell/Qwen2.5-1.5B-Instruct-GRPO-DAPO17k-thinking

2B • Updated Oct 6 • 5

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math220k-thinking

Text Generation • 2B • Updated Oct 5 • 3

AIcell/Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k

Text Generation • 2B • Updated Sep 29 • 4

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-random-reward

Text Generation • 2B • Updated Sep 26 • 3

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-no-thinking

2B • Updated Sep 26

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-monitor

Text Generation • 2B • Updated Sep 12 • 2

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-plain

Text Generation • 2B • Updated Sep 12 • 9

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-GPQA-Diamond-thinking

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-MATH-500-thinking

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-thinking

AIcell/Qwen2.5-1.5B-Base-GRPO-Math12k

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-no-thinkng

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k

AIcell/Qwen2.5-1.5B-Instruct-GRPO

AIcell/Qwen2.5-Math-1.5B-GRPO