LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization - Explained Simply | ArXiv Explained