Nearly Optimal Active Preference Learning and Its Application to LLM Alignment - Explained Simply | ArXiv Explained