Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published 29 days ago • 93 • 9
UniVideo: Unified Understanding, Generation, and Editing for Videos Paper • 2510.08377 • Published 23 days ago • 68
Self-Improvement in Multimodal Large Language Models: A Survey Paper • 2510.02665 • Published 30 days ago • 19 • 6
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26 • 179
Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts Paper • 2506.10357 • Published Jun 12 • 21