Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift - Explained Simply | ArXiv Explained