UI-Genie Collection UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents • 3 items • Updated 1 day ago
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning Paper • 2510.14958 • Published 13 days ago • 22
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving Paper • 2510.12796 • Published 15 days ago • 11
WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning Paper • 2509.22644 • Published Sep 26 • 20
VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing Paper • 2509.22651 • Published Sep 26 • 22
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28 • 81
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28 • 81
DocMark Collection Models and Dataset for CVPR 2025 paper: Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding • 3 items • Updated Jun 16
DocMark Collection Models and Dataset for CVPR 2025 paper: Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding • 3 items • Updated Jun 16
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published Jun 11 • 48