Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published 2 days ago • 60
ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems Paper • 2510.26475 • Published Oct 30
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published 10 days ago • 44
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published 10 days ago • 44
Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation Paper • 2506.04614 • Published Jun 5 • 19
Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration Paper • 2505.21471 • Published May 27 • 5
Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization Paper • 2502.14496 • Published Feb 20
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 429