Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis - Explained Simply | ArXiv Explained