From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models - Explained Simply | ArXiv Explained