Shaping capabilities with token-level data filtering - Explained Simply | ArXiv Explained