SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning - Explained Simply | ArXiv Explained