RLinf-Co: Reinforcement Learning-Based Sim-Real Co-Training for VLA Models - Explained Simply | ArXiv Explained