Yufei He

4 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×4AI×3NLP×2ML×2

Frequent co-authors

Yulin Chen4×

Tri Cao4×

Bryan Hooi4×

Yuexin Li3×

Wenjie Qu2×

Linyu Wu2×

Research Timeline

2026

WebAgentGuard: A Reasoning-Driven Guard Model for Detecting Prompt Injection Attacks in Web Agents

The paper introduces WebAgentGuard, a novel reasoning-driven, multimodal guard model that effectively detects prompt injection attacks in vulnerable web agents without compromising their functionality.

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections

The paper proposes WARD, a robust and efficient defense model that secures web agents against prompt injection attacks embedded in web content, achieving high recall and low false positives even against adaptive attacks.

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

AliMark proposes a novel watermarking framework that treats sentence-level watermarking as a bit sequence alignment problem, significantly enhancing robustness against structural text perturbations like sentence splitting and merging.

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

AliMark proposes a novel framework that enhances the robustness of sentence-level watermarking by reformulating the problem as a bit sequence encoding and alignment task, significantly improving resilience against structural text perturbations like sentence splitting and merging.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.CLRecentMay 28, 2026

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

Yuexin Li, Wenjie Qu, Linyu Wu, Yulin Chen +4 more

View →

cs.CRcs.AIcs.CLRecentMay 28, 2026