Liang Li

10 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×7Crypto×7NLP×3Databases×1Systems and Control×1ML×1

Frequent co-authors

Liang Lin2×

Zhiqing Ma1×

Zhonghao Xu1×

Dong Yu1×

Chen Kang1×

Changliang Li1×

Research Timeline

2026

Stealthy and Adjustable Text-Guided Backdoor Attacks on Multimodal Pretrained Models

The paper proposes a novel Text-Guided Backdoor (TGB) attack that uses common words in text descriptions as stealthy triggers for multimodal models, enhancing practicality and controllability.

ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

The paper introduces ProjLens, an interpretability framework that reveals that backdoor vulnerabilities in Multimodal Large Language Models (MLLMs) are encoded within a low-rank subspace of the projector, causing a measurable semantic shift in poisoned inputs.

DCVD: Dual-Channel Cross-Modal Fusion for Joint Vulnerability Detection and Localization

DCVD proposes a dual-channel cross-modal fusion framework that jointly detects software vulnerabilities and precisely localizes the vulnerable lines, outperforming existing state-of-the-art methods.

Behavioral Integrity Verification for AI Agent Skills

The paper introduces Behavioral Integrity Verification (BIV), a framework that systematically audits AI agent skills by comparing their declared capabilities against their actual implementation, revealing a high rate of behavioral deviation.

Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution

The EvoSafety framework enhances LLM safety by externalizing attack and defense mechanisms, enabling persistent, transferable, and model-agnostic robustness against adversarial prompts.

When the Manual Lies: A Realistic Benchmark to Evaluate MCP Poisoning Attacks for LLM Agents

This paper introduces a new benchmark to test Tool Description Poisoning (TDP) attacks on LLM agents, demonstrating that even advanced models like GPT-4o are highly vulnerable and that current defenses are often ineffective.

Learning Agent-Compatible Context Management for Long-Horizon Tasks

The paper introduces Adaptive Context Management (AdaCoM), an external context manager that uses reinforcement learning to improve the performance of frozen LLM agents on long-horizon tasks by intelligently managing and pruning accumulated context.

Inference Cost Attacks for Retrieval-Augmented Large Language Models

This paper introduces a novel attack, RA-ICA, that targets RAG-enhanced LLMs by poisoning external knowledge bases to drastically increase inference costs, achieving up to a 13.12x increase in token consumption.

THRD: A Training-Free Multi-Turn Defense Framework for Jailbreak Attacks on Large Language Models

THRD introduces a novel, training-free framework that models temporal risk accumulation to effectively defend against multi-turn jailbreak attacks on LLMs, significantly reducing attack success rates while maintaining model utility.

What to Format and How: A Benchmark and Workflow Approach for Document Formatting

The paper introduces DocFormBench, a new benchmark for content-aware document formatting, and proposes DocFormFlow, a workflow that improves formatting accuracy and efficiency by decoupling target localization from modification execution.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIRecentJun 1, 2026

THRD: A Training-Free Multi-Turn Defense Framework for Jailbreak Attacks on Large Language Models

Zhiqing Ma, Zhonghao Xu, Dong Yu, Chen Kang +2 more

View →

cs.CLRecentJun 1, 2026