Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Sheng Di

Sheng Di

3 indexed papers

Recent (6 mo)
3
With code
0
Influential cites
0
Benchmarked
0

Publications per year

3
26

Top categories

AI×3ML×2Distributed×1Optimization and Control×1Stats ML×1

Frequent co-authors

Azal Ahmad Khan1×
Ammar Ahmed1×
Zeshan Fayyaz1×
Mingyi Hong1×
Ali Anwar1×
Yafan Huang1×

Research Timeline

2026
Unlearning in Diffusion Models: A Unified Framework with KL Divergence and Likelihood Constraints

The paper proposes a unified, constrained optimization framework using KL divergence and likelihood constraints to achieve effective and principled unlearning in diffusion models.

Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

The paper introduces Straggler-Aware Group Control (SAGC), a dynamic group-size controller that optimizes synchronous on-policy RL training by adapting group size to minimize delays caused by slow rollouts (stragglers), thereby improving wall-clock efficiency and model performance.

Not All Errors Are Equal: A Systematic Study of Error Propagation in Large Language Model Inference

This paper systematically studies how soft errors propagate during Large Language Model (LLM) inference using a novel fault-injection framework, providing critical insights and mitigation strategies for improving LLM reliability.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIRecentJun 1, 2026

Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

Azal Ahmad Khan, Ammar Ahmed, Zeshan Fayyaz, Sheng Di +2 more

The paper introduces Straggler-Aware Group Control (SAGC), a dynamic group-size controller that optimizes synchronous on-policy RL training by adapting group size to minimize delays caused by slow rol…

View →
cs.DCcs.AIRecentJun 1, 2026

Not All Errors Are Equal: A Systematic Study of Error Propagation in Large Language Model Inference

Yafan Huang, Sheng Di, Guanpeng Li

This paper systematically studies how soft errors propagate during Large Language Model (LLM) inference using a novel fault-injection framework, providing critical insights and mitigation strategies f…

View →
cs.LGcs.AImath.OCRecentMay 29, 2026

Unlearning in Diffusion Models: A Unified Framework with KL Divergence and Likelihood Constraints

Shervin Khalafi, Alejandro Ribeiro, Dongsheng Ding

The paper proposes a unified, constrained optimization framework using KL divergence and likelihood constraints to achieve effective and principled unlearning in diffusion models.

View →