Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors Paper • 2212.04248 • Published Dec 7, 2022 • 1
ConsistEdit: Highly Consistent and Precise Training-free Visual Editing Paper • 2510.17803 • Published Oct 20 • 13
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence Paper • 2509.12203 • Published Sep 15 • 19
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Paper • 2508.09131 • Published Aug 12 • 16