Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Jiang Liu

Jiang Liu

3 indexed papers

Recent (6 mo)
3
With code
0
Influential cites
0
Benchmarked
0

Publications per year

3
26

Top categories

AI×2ML×1NLP×1Crypto×1Sound×1

Frequent co-authors

Daize Dong1×
Junlin Chen1×
Haolong Jia1×
Jiawei Wu1×
Huanwei Di1×
Jialian Wu1×

Research Timeline

2026
Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors

The paper introduces DiffErase, a black-box attack that effectively removes inaudible audio watermarks while preserving perceptual quality by utilizing diffusion models.

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning

The paper proposes Predictive Routing Replay (PR2) to stabilize reinforcement learning on Mixture of Experts (MoE) LLMs by predicting and incorporating short-horizon router evolution during training and rollout.

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

The paper introduces Lookahead Group Reward (&) to combat Supervision Fidelity Decay (SFD) in on-policy distillation, significantly improving student model performance on long reasoning tasks.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIRecentMay 29, 2026

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning

Daize Dong, Junlin Chen, Haolong Jia, Jiawei Wu +8 more

The paper proposes Predictive Routing Replay (PR2) to stabilize reinforcement learning on Mixture of Experts (MoE) LLMs by predicting and incorporating short-horizon router evolution during training a…

View →
cs.CLcs.AIRecentMay 29, 2026

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

Yanjiang Liu, Jie Lou, Xinyan Guan, Yuqiu Ji +6 more

The paper introduces Lookahead Group Reward (&) to combat Supervision Fidelity Decay (SFD) in on-policy distillation, significantly improving student model performance on long reasoning tasks.

View →
cs.CRcs.SDRecentMay 28, 2026

Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors

Lingfeng Yao, Xincong Zhong, Chenpei Huang, Xuandong Zhao +5 more

The paper introduces DiffErase, a black-box attack that effectively removes inaudible audio watermarks while preserving perceptual quality by utilizing diffusion models.

View →