DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published Oct 13 • 27
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation Paper • 2303.11089 • Published Mar 20, 2023 • 1
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Paper • 2412.14167 • Published Dec 18, 2024 • 1
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published Apr 11 • 42
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations Paper • 2505.18096 • Published May 23
Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Paper • 2507.07982 • Published Jul 10 • 33
Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Paper • 2507.07982 • Published Jul 10 • 33