Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models - Explained Simply | ArXiv Explained