BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation Paper • 2509.25077 • Published about 1 month ago • 14
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling Paper • 2509.12201 • Published Sep 15 • 103
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models Paper • 2412.13188 • Published Dec 17, 2024
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Paper • 2507.11540 • Published Jul 15 • 4
WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool Paper • 2509.05296 • Published Sep 5 • 7
Precise Action-to-Video Generation Through Visual Action Prompts Paper • 2508.13104 • Published Aug 18 • 11
Neural 3D Scene Reconstruction with the Manhattan-world Assumption Paper • 2205.02836 • Published May 5, 2022
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation Paper • 2503.14483 • Published Mar 18 • 1