Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective - Explained Simply | ArXiv Explained