Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation - Explained Simply | ArXiv Explained