Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks Paper • 2511.15065 • Published 3 days ago • 69
InteractComp: Evaluating Search Agents With Ambiguous Queries Paper • 2510.24668 • Published 24 days ago • 96
A Survey of Data Agents: Emerging Paradigm or Overstated Hype? Paper • 2510.23587 • Published 25 days ago • 65
ReCode: Unify Plan and Action for Universal Granularity Control Paper • 2510.23564 • Published 25 days ago • 119
Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting Paper • 2505.19716 • Published May 26 • 4
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 300