InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models Paper • 2509.22536 • Published Sep 26 • 2
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 26 days ago • 173
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization Paper • 2508.05731 • Published Aug 7 • 25