LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 12 days ago • 70
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 19 days ago • 111
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper • 2512.00425 • Published 28 days ago • 49
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 25 days ago • 50
How Far Are We from Genuinely Useful Deep Research Agents? Paper • 2512.01948 • Published 26 days ago • 53
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published 26 days ago • 69
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published about 1 month ago • 80
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published about 1 month ago • 109
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published 24 days ago • 149
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published Nov 25 • 166
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 23 days ago • 168
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 30 days ago • 214
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 25 days ago • 236
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 22 days ago • 38
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper • 2512.07831 • Published 19 days ago • 16