Ruihua Song

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Sound×1AI×1Multimedia×1

Frequent co-authors

Yuyue Wang1×

Xihua Wang1×

Xin Cheng1×

Yijing Chen1×

Research Timeline

2026

Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts

The paper introduces PlanAudio, a unified LLM-based framework that directly synthesizes natural, composite audio containing speech and sounds from unconstrained free-form text prompts, outperforming existing methods.

Highlighted terms show continued research focus across papers

Papers

cs.SDcs.AIcs.MMRecentMay 27, 2026

Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts

Yuyue Wang, Xihua Wang, Xin Cheng, Yijing Chen +1 more

View →