Zhiyong Wu

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Audio and Speech Processing×1AI×1Sound×1

Frequent co-authors

Zhisheng Zhang1×

Xiang Li1×

Yixuan Zhou1×

Jing Peng1×

Guoyang Zeng1×

Research Timeline

2026

LoSATok: Low-dimensional Semantic-Acoustic Tokenizer for Cross-Domain Audio Understanding and Generation

LoSATok proposes a low-dimensional semantic-acoustic tokenizer that efficiently compresses high-dimensional audio features into a compact latent space, significantly improving the performance and efficiency of audio generation models.

Highlighted terms show continued research focus across papers

Papers

eess.AScs.AIcs.SDRecentMay 27, 2026

LoSATok: Low-dimensional Semantic-Acoustic Tokenizer for Cross-Domain Audio Understanding and Generation

Zhisheng Zhang, Xiang Li, Yixuan Zhou, Jing Peng +2 more

View →