AI & ML interests
None defined yet.
Recent Activity
Papers
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence
InternLM2 Reward Models
-
internlm/internlm2-math-plus-20b
Text Generation • 20B • Updated • 164 • 7 -
internlm/internlm2-math-plus-7b
Text Generation • 8B • Updated • 436 • 11 -
internlm/internlm2-math-plus-1_8b
Text Generation • 2B • Updated • 149 • 12 -
internlm/internlm2-math-plus-mixtral8x22b
Text Generation • 141B • Updated • 62 • 18
-
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 204 -
OpenGVLab/InternVL3_5-241B-A28B
Image-Text-to-Text • 241B • Updated • 5.55k • 131 -
OpenGVLab/InternVL3_5-38B
Image-Text-to-Text • 38B • Updated • 8.34k • 35 -
OpenGVLab/InternVL3_5-30B-A3B
Image-Text-to-Text • 31B • Updated • 40k • 38
-
internlm/OREAL-32B
Text Generation • 33B • Updated • 97 • 24 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 84 • • 20 -
internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 68 • 9 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 59
-
internlm/internlm-xcomposer2-4khd-7b
Visual Question Answering • Updated • 961 • 74 -
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering • Updated • 1.56k • 83 -
internlm/internlm-xcomposer2-vl-1_8b
Visual Question Answering • Updated • 83 • 18 -
internlm/internlm-xcomposer2-7b
Text Generation • Updated • 1.94k • 31
-
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 204 -
OpenGVLab/InternVL3_5-241B-A28B
Image-Text-to-Text • 241B • Updated • 5.55k • 131 -
OpenGVLab/InternVL3_5-38B
Image-Text-to-Text • 38B • Updated • 8.34k • 35 -
OpenGVLab/InternVL3_5-30B-A3B
Image-Text-to-Text • 31B • Updated • 40k • 38
-
internlm/OREAL-32B
Text Generation • 33B • Updated • 97 • 24 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 84 • • 20 -
internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 68 • 9 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 59
InternLM2 Reward Models
-
internlm/internlm2-math-plus-20b
Text Generation • 20B • Updated • 164 • 7 -
internlm/internlm2-math-plus-7b
Text Generation • 8B • Updated • 436 • 11 -
internlm/internlm2-math-plus-1_8b
Text Generation • 2B • Updated • 149 • 12 -
internlm/internlm2-math-plus-mixtral8x22b
Text Generation • 141B • Updated • 62 • 18
-
internlm/internlm-xcomposer2-4khd-7b
Visual Question Answering • Updated • 961 • 74 -
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering • Updated • 1.56k • 83 -
internlm/internlm-xcomposer2-vl-1_8b
Visual Question Answering • Updated • 83 • 18 -
internlm/internlm-xcomposer2-7b
Text Generation • Updated • 1.94k • 31