Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published Oct 16 • 33 • 2
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System Paper • 2509.18091 • Published Sep 22 • 33 • 3
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search Paper • 2505.14680 • Published May 20 • 9 • 2
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation Paper • 2503.22675 • Published Mar 28 • 36 • 2
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents Paper • 2503.08684 • Published Mar 11 • 5 • 2