Dispersion loss counteracts embedding condensation in small language modelsToday•chenliu-1996.github.io