Video-As-Prompt: Unified Semantic Control for Video Generation Paper • 2510.20888 • Published Oct 23 • 44
From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model Paper • 2510.19871 • Published Oct 22 • 29
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published Oct 16 • 47
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation Paper • 2503.09151 • Published Mar 12 • 32
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published May 24 • 48
Align Your Tangent: Training Better Consistency Models via Manifold-Aligned Tangents Paper • 2510.00658 • Published Oct 1 • 3
OneFlow: Concurrent Mixed-Modal and Interleaved Generation with Edit Flows Paper • 2510.03506 • Published Oct 3 • 14
Align Your Tangent: Training Better Consistency Models via Manifold-Aligned Tangents Paper • 2510.00658 • Published Oct 1 • 3 • 2
Aligning Text to Image in Diffusion Models is Easier Than You Think Paper • 2503.08250 • Published Mar 11 • 2