Hao Yin

4 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×3Crypto×3

Frequent co-authors

Deyue Zhang2×

Yibing Liu1×

Research Timeline

2026

AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualization

AgentVisor is a novel defense framework that uses semantic virtualization, inspired by OS principles, to significantly reduce LLM agent vulnerability to prompt injection while maintaining high utility.

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety

SafeHarbor is a novel, hierarchical memory-augmented framework that establishes context-aware decision boundaries for LLM agents, achieving state-of-the-art safety while minimizing over-refusal.

DMN: A Compositional Framework for Jailbreaking Multimodal LLMs with Multi-Image Inputs

The paper proposes DMN, a compositional jailbreak framework that utilizes distributed instructions, multimodal evidence, and a number chain task across multiple images to significantly enhance the attack success rate against multimodal LLMs.

OpenClawBench: Benchmarking Process-side Anomalies in Real-world Agent Execution Trajectories

The paper introduces OpenClawBench, a large-scale dataset and framework for measuring process-side anomalies in real-world agent execution trajectories, demonstrating that task success does not guarantee operational reliability.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 28, 2026

OpenClawBench: Benchmarking Process-side Anomalies in Real-world Agent Execution Trajectories

Yibing Liu, Yangze Liu, Xiaolong Yin, Bin Wang +3 more

View →

cs.CRcs.AIRecentMay 18, 2026