Elena Mocanu

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

ML×2AI×2

Frequent co-authors

Boqian Wu2×

Qiao Xiao2×

Research Timeline

2026

Memory-Efficient LLM Training with Dynamic Sparsity: From Stability to Practical Scaling

The paper introduces Sparse Memory-Efficient Training (SMET), a method that stabilizes and optimizes Dynamic Sparse Training (DST) for large language models, enabling stable and memory-efficient sparse pre-training.

When Data Is Scarce: Scaling Sparse Language Models with Repeated Training

This paper introduces a new scaling law for sparse language models trained with limited data, demonstrating that sparsity can significantly improve performance and delay data saturation during multi-epoch training.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIRecentMay 31, 2026

When Data Is Scarce: Scaling Sparse Language Models with Repeated Training

Boqian Wu, Qiao Xiao, Patrik Okanovic, Tomasz Sternal +5 more

View →

cs.LGcs.AIRecentMay 30, 2026