SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models Paper • 2510.12784 • Published Oct 14 • 20
view article Article Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models Jul 10 • 53
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Paper • 2503.24388 • Published Mar 31 • 30