MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models - Explained Simply | ArXiv Explained