ColPali: Efficient Document Retrieval with Vision Language Models - Explained Simply | ArXiv Explained