Chaochao Lu

5 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×5NLP×4AI×2Vision×2ML×2

Frequent co-authors

Yu Li4×

Tianhang Zheng3×

Yuenan Hou2×

Yingmei Wei2×

Yanming Guo2×

Dongrui Liu2×

Research Timeline

2026

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

The paper introduces AgentDoG 1.5, a lightweight and scalable alignment framework that significantly improves AI agent safety and security for complex, open-world agentic scenarios.

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

The paper introduces AgentDoG 1.5, a lightweight and scalable alignment framework that significantly improves AI agent safety and security for complex open-world agent deployments.

EvoDefense: Co-Evolving Black-Box Defense with Large Language Models

EvoDefense introduces an experience-guided, co-evolving black-box defense mechanism that significantly improves LLM robustness against unseen and diverse attacks without requiring model retraining.

EvoDefense: Co-Evolving Black-Box Defense with Large Language Models

EvoDefense introduces an experience-guided, co-evolving black-box defense mechanism that significantly improves the robustness of LLMs against unseen and diverse attacks without requiring model retraining.

TRACE: Task-Aware Adaptive Self-Evolving Agentic Jailbreaking

The paper proposes TRACE, a novel agentic jailbreaking framework that successfully bypasses safety mechanisms of advanced LLM agents by decomposing malicious tasks and disguising harmful subtasks within task-aware, iteratively evolved scenarios.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.CLRecentMay 29, 2026

EvoDefense: Co-Evolving Black-Box Defense with Large Language Models

Yu Li, Yuenan Hou, Yingmei Wei, Yanming Guo +1 more

EvoDefense introduces an experience-guided, co-evolving black-box defense mechanism that significantly improves LLM robustness against unseen and diverse attacks without requiring model retraining.

View →

cs.CRcs.CLRecentMay 29, 2026