CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions - Explained Simply | ArXiv Explained