Yue Zhao

10 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×9AI×8NLP×1Society×1ML×1Software Eng.×1

Frequent co-authors

Zhengyue Zhao2×

Chaowei Xiao2×

Ojas Nimase2×

Zhe Chen2×

Gengpei Qi2×

Xiyang Hu2×

Research Timeline

2026

The Autonomy Tax: Defense Training Breaks LLM Agents

Defense training for LLM agents, intended to improve safety, systematically degrades their core competence, leading to unreliability in multi-step tasks.

CNT: Safety-oriented Function Reuse across LLMs via Cross-Model Neuron Transfer

The paper introduces Cross-Model Neuron Transfer (CNT), a post-hoc method that efficiently transfers safety-oriented functionalities between different large language models by transferring minimal subsets of neurons, achieving high performance with minimal degradation.

Agent Audit: A Security Analysis System for LLM Agent Applications

Agent Audit is a novel security analysis system that comprehensively audits LLM agent applications by examining the entire software stack—including tool code, configuration, and prompts—to detect a wide range of vulnerabilities.

Infrastructure for Valuable, Tradable, and Verifiable Agent Memory

The paper proposes an infrastructure, clawgang and meowtrade, to transform private, non-transferable agent memories into verifiable, tradable economic commodities.

No Attacker Needed: Unintentional Cross-User Contamination in Shared-State LLM Agents

This paper identifies and analyzes unintentional cross-user contamination (UCC), a failure mode where benign, scope-bound artifacts degrade the outcomes of different users in shared-state LLM agents, requiring artifact-level defenses.

LPG: Balancing Efficiency and Policy Reasoning in Latent Policy Guardrails

The paper introduces Latent Policy Guardrail (LPG), a novel framework that efficiently enforces dynamic safety policies for LLMs by compressing complex policy deliberation into a small set of latent tokens, achieving high accuracy with significantly reduced latency.

GEO-Bench: Benchmarking Ranking Manipulation in Generative Engine Optimization

GEO-Bench introduces a standardized benchmark to compare various ranking manipulation attacks (both black-box and white-box) on generative engines, demonstrating that black-box content rewriting can be highly effective and stealthy.

OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents

The paper introduces OR-Space, a novel full-lifecycle workspace benchmark designed to rigorously evaluate industrial optimization agents by simulating real-world, multi-stage OR workflows that go beyond simple model translation.

GEO-Bench: Benchmarking Ranking Manipulation in Generative Engine Optimization

The paper introduces GEO-Bench, a unified benchmark that standardizes the evaluation of various generative engine optimization (GEO) ranking manipulation attacks, demonstrating that black-box content rewriting can be highly effective and stealthy compared to gradient-based methods.

MaskForge: Structure-Aware Adaptive Attacks for Jailbreaking Diffusion Large Language Models

MaskForge is a novel, adaptive, black-box attack framework that significantly improves jailbreaking diffusion large language models (dLLMs) by treating red-teaming as an optimized search over reusable structural patterns.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIRecentJun 1, 2026

MaskForge: Structure-Aware Adaptive Attacks for Jailbreaking Diffusion Large Language Models

Yingzi Ma, Zhengyue Zhao, Xiaogeng Liu, Minhui Xue +2 more

View →

cs.CRcs.AIRecentMay 27, 2026