Pruning and Distilling Mixture-of-Experts into Dense Language Models | ArxivCSExplorer