Siheng Xiong
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes replacing expensive, always-on LLM calls for proactive agent triggering with a specialized Temporal-Graph-Learning (TGL) model, significantly improving efficiency and performance.
The paper introduces DSL-LLaDA, a method that lightly adapts a pre-trained masked diffusion language model to perform continuous denoising in embedding space, significantly improving text generation quality and robustness, especially under low step budgets.
Papers
DSL-LLaDA: Scaling Continuous Denoising to 8B Masked Diffusion LMs
Longxuan Yu, Yunshu Wu, Yu Fu, Siheng Xiong +4 more
The paper introduces DSL-LLaDA, a method that lightly adapts a pre-trained masked diffusion language model to perform continuous denoising in embedding space, significantly improving text generation q…