arxiv:2506.03569
Li Shicheng
lscpku
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
15 days ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
liked
a model
3 months ago
XiaomiMiMo/MiMo-VL-7B-SFT-2508
liked
a model
3 months ago
XiaomiMiMo/MiMo-VL-7B-RL-2508