Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm - Explained Simply | ArXiv Explained