Mingyi Wang

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

ML×2NLP×1AI×1

Frequent co-authors

Yuhang Zhou1×

Lizhu Zhang1×

Yifan Wu1×

Peng Bo1×

Jiayi Liu1×

Xiangjun Fan1×

Research Timeline

2026

Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization

The paper proposes Detector Evasion Policy Optimization (DEPO), a constrained reinforcement learning method that effectively evades AI text detectors while strictly maintaining the original text's semantics.

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

OmniOPD introduces a logit-free, chunk-level distillation framework that improves on standard On-Policy Distillation by using semantic similarity and peak-entropy scheduling, achieving state-of-the-art performance even with black-box teachers.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.CLRecentMay 31, 2026

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Yuhang Zhou, Lizhu Zhang, Yifan Wu, Mingyi Wang +4 more

View →

cs.LGcs.AIRecentMay 29, 2026