Efficient Video Prediction via Sparsely Conditioned Flow Matching Paper • 2211.14575 • Published Nov 26, 2022 • 1
Communication-Inspired Tokenization for Structured Image Representations Paper • 2602.20731 • Published Feb 24 • 4
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 22 days ago • 14
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control Paper • 2412.11198 • Published Dec 15, 2024 • 2
From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models Paper • 2506.07280 • Published Jun 10, 2025 • 1
Communication-Inspired Tokenization for Structured Image Representations Paper • 2602.20731 • Published Feb 24 • 4
Rethinking Visual Intelligence: Insights from Video Pretraining Paper • 2510.24448 • Published Oct 28, 2025 • 7