EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing Paper • 2512.06065 • Published 4 days ago • 18
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Paper • 2509.26625 • Published Sep 30 • 43
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Paper • 2504.07961 • Published Apr 10 • 5