Edit images based on user instructions
Generate detailed 3D models from images and coarse models
Text-to-Video