Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published Sep 29 • 138
Cache-of-Thought: Master-Apprentice Framework for Cost-Effective Vision Language Model Inference Paper • 2502.20587 • Published Feb 27 • 1
Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling? Paper • 2506.17417 • Published Jun 20 • 12
VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use Paper • 2505.19255 • Published May 25 • 5
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance Paper • 2506.06444 • Published Jun 6 • 73