Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding - Explained Simply | ArXiv Explained