Computer Vision and Deep Learning
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation