Guowen Xu

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2NLP×1

Frequent co-authors

Rui Zhang2×

Hongwei Li2×

Zihan Wang1×

Yu Liu1×

Chi Liu1×

Qingchuan Zhao1×

Research Timeline

2026

The Art of (Mis)alignment: How Fine-Tuning Methods Effectively Misalign and Realign LLMs in Post-Training

The paper investigates how various fine-tuning methods can be used both to intentionally misalign and subsequently realign large language models (LLMs), revealing distinct strengths for attack and defense mechanisms.

Black-Box Skill Stealing Attack from Proprietary LLM Agents: An Empirical Study

This paper presents the first systematic study of black-box skill stealing attacks against proprietary LLM agents, demonstrating that structured agent skills can be easily extracted, posing a significant and often overlooked copyright risk.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentApr 23, 2026

Black-Box Skill Stealing Attack from Proprietary LLM Agents: An Empirical Study

Zihan Wang, Rui Zhang, Yu Liu, Chi Liu +3 more

View →

cs.CRcs.CLRecentApr 9, 2026