xxh
shreddedpork
ยท
AI & ML interests
Diffusion Model, MLLM
Recent Activity
authored a paper about 4 hours ago
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens authored a paper about 4 hours ago
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation authored a paper about 4 hours ago
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy OptimizationOrganizations
None yet