Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14 • 28
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Paper • 2505.17005 • Published May 22 • 5
LLMBox: A Comprehensive Library for Large Language Models Paper • 2407.05563 • Published Jul 8, 2024
Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search Paper • 2411.11694 • Published Nov 18, 2024