Lu Yan

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×1Software Eng.×1Crypto×1

Frequent co-authors

Xuan Chen2×

Xiangyu Zhang2×

Ruqi Zhang1×

Research Timeline

2026

Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety

The paper introduces SafeAudit, a meta-audit framework that systematically enumerates test cases and uses a quantitative metric to uncover significant residual unsafe behaviors in LLM agents that existing benchmarks miss.

Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles

The paper introduces WIRE, a pipeline for diagnosing live intra-policy rule conflicts in LLM agents by identifying and testing specific rule pairs within a single prompt policy that can co-govern a realistic state.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 27, 2026

Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles

Lu Yan, Xuan Chen, Xiangyu Zhang

View →

cs.SEcs.CRRecentMar 18, 2026