Ruihua Song
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Sound×1AI×1Multimedia×1
Frequent co-authors
Research Timeline
2026
Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts
The paper introduces PlanAudio, a unified LLM-based framework that directly synthesizes natural, composite audio containing speech and sounds from unconstrained free-form text prompts, outperforming existing methods.
Highlighted terms show continued research focus across papers
Papers
cs.SDcs.AIcs.MMRecentMay 27, 2026
Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts
Yuyue Wang, Xihua Wang, Xin Cheng, Yijing Chen +1 more
The paper introduces PlanAudio, a unified LLM-based framework that directly synthesizes natural, composite audio containing speech and sounds from unconstrained free-form text prompts, outperforming e…
View →