Dispersion loss counteracts embedding condensation in small language modelsRedditJul 04, 2026 00:02Sharechenliu-1996.github.iochenliu-1996.github.io