RLRM

community

AI & ML interests

None defined yet.

Recent Activity

DongfuJiang authored a paper about 1 month ago

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

DongfuJiang authored a paper about 1 month ago

VideoScore2: Think before You Score in Generative Video Evaluation

DongfuJiang authored a paper 2 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

View all activity

RLRM 's models 1

RLRM/big_math_rl_pair_ct_7B

8B • Updated Mar 26