Meng Sun

5 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Vision×3AI×3NLP×2Crypto×2HCI×1Info Retrieval×1ML×1

Frequent co-authors

Yihao Zhang2×

Shenghui Chen1×

Po-han Li1×

Ximeng Sun1×

Shijia Yang1×

Emad Barsoum1×

Research Timeline

2026

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

The paper introduces Salami Slicing Risk, a novel multi-turn jailbreak technique that accumulates harmful intent through numerous low-risk inputs, achieving state-of-the-art attack success rates against major LLMs.

VOW: Verifiable and Oblivious Watermark Detection for Large Language Models

VOW introduces a novel, privacy-preserving, and cryptographically verifiable protocol for detecting watermarks in LLM-generated text, overcoming the limitations of centralized and non-verifiable existing methods.

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

The paper introduces Harness-1, a search agent that separates semantic decision-making from state management by using a stateful search harness, achieving state-of-the-art performance across diverse retrieval benchmarks.

MonoIR-RS: Infrared Remote Sensing Vision-Language Learning with CLIP and VLM Adaptation

This paper introduces MonoIR-RS, a large-scale infrared remote-sensing vision-language dataset and benchmark for understanding infrared imagery.

VEGAS: Human-Aligned Video Caption Evaluation via Gaze

The paper proposes VEGAS, a metric for video captioning that uses test-time gaze to sample personalized captions aligning with human focus.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIcs.HCEmpiricalRecentJul 9, 2026

VEGAS: Human-Aligned Video Caption Evaluation via Gaze

Shenghui Chen, Po-han Li, Ximeng Sun, Shijia Yang +4 more

The paper proposes VEGAS, a metric for video captioning that uses test-time gaze to sample personalized captions aligning with human focus.

View →

cs.CVDatasetRecent