OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM - Explained Simply | ArXiv Explained