Weiwei Qi

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2

Frequent co-authors

Tianhang Zheng2×

Zhan Qin2×

Kui Ren2×

Churui Zeng1×

Kedong Xiu1×

Chaochao Lu1×

Research Timeline

2026

Towards Identification and Intervention of Safety-Critical Parameters in Large Language Models

The paper proposes the Expected Safety Impact (ESI) framework to identify safety-critical parameters in LLMs, introducing targeted tuning methods (SET and SPA) to enhance safety and preserve alignment during model adaptation.

TRACE: Task-Aware Adaptive Self-Evolving Agentic Jailbreaking

The paper proposes TRACE, a novel agentic jailbreaking framework that successfully bypasses safety mechanisms of advanced LLM agents by decomposing malicious tasks and disguising harmful subtasks within task-aware, iteratively evolved scenarios.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentMay 29, 2026

TRACE: Task-Aware Adaptive Self-Evolving Agentic Jailbreaking

Churui Zeng, Weiwei Qi, Kedong Xiu, Tianhang Zheng +4 more

View →

cs.CRRecentApr 9, 2026