Jiang Liu

4 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×3NLP×2ML×2Multiagent×1Robotics×1Crypto×1Sound×1

Frequent co-authors

Jacky Kwok1×

Shulu Li1×

Research Timeline

2026

Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors

The paper introduces DiffErase, a black-box attack that effectively removes inaudible audio watermarks while preserving perceptual quality by utilizing diffusion models.

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning

The paper proposes Predictive Routing Replay (PR2) to stabilize reinforcement learning on Mixture of Experts (MoE) LLMs by predicting and incorporating short-horizon router evolution during training and rollout.

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

The paper introduces Lookahead Group Reward (&) to combat Supervision Fidelity Decay (SFD) in on-policy distillation, significantly improving student model performance on long reasoning tasks.

LLM-as-a-Verifier: A General-Purpose Verification Framework

This paper introduces LLM-as-a-Verifier, a framework for fine-grained verification of LLMs using continuous scores, achieving state-of-the-art performance on various benchmarks.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.CLcs.LGEmpiricalRecentJul 6, 2026

LLM-as-a-Verifier: A General-Purpose Verification Framework

Jacky Kwok, Shulu Li, Pranav Atreya, Yuejiang Liu +5 more

This paper introduces LLM-as-a-Verifier, a framework for fine-grained verification of LLMs using continuous scores, achieving state-of-the-art performance on various benchmarks.

View →

cs.LGcs.AIRecent