Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs - Explained Simply | ArXiv Explained