SARAH: Spatially Aware Real-time Agentic Humans - Explained Simply | ArXiv Explained