Kailong Wang

5 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×5AI×2

Frequent co-authors

Yuxi Li2×

Zhibo Zhang2×

Haoyu Wang2×

Tianlong Yu2×

Xingshuo Han1×

Ling Shi1×

Research Timeline

2026

When Safe Models Merge into Danger: Exploiting Latent Vulnerabilities in LLM Fusion

The paper introduces TrojanMerge, a framework demonstrating that model merging can be exploited to systematically compromise the safety alignment of multiple individually safe LLMs.

RefineRAG: Word-Level Poisoning Attacks via Retriever-Guided Text Refinement

RefineRAG introduces a novel word-level poisoning framework that significantly enhances knowledge poisoning attacks against RAG systems, achieving state-of-the-art effectiveness and transferability to black-box environments.

MATRIX: Multi-Layer Code Watermarking via Dual-Channel Constrained Parity-Check Encoding

MATRIX is a novel, robust code watermarking framework that encodes watermarks using constrained parity-check matrix equations, achieving high detection accuracy and improved robustness for code provenance tracking.

UNSEEN: A Cross-Stack LLM Unlearning Defense against AR-LLM Social Engineering Attacks

The paper proposes UNSEEN, a cross-stack defense system combining AR access control, LLM unlearning, and agent guardrails to mitigate sophisticated AR-LLM social engineering attacks.

Defense Against LLM Backdoors using Critical Neuron Isolation Pruning

The paper introduces DeCNIP, a method for identifying and neutralizing backdoors in large language models using representational analysis and neuron isolation pruning.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIEmpiricalRecentJul 22, 2026

Defense Against LLM Backdoors using Critical Neuron Isolation Pruning

Yuxi Li, Zhibo Zhang, Kailong Wang, Xingshuo Han +2 more

The paper introduces DeCNIP, a method for identifying and neutralizing backdoors in large language models using representational analysis and neuron isolation pruning.

View →

cs.CRcs.AIRecentApr 25, 2026