AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning

The paper introduces BadSkill, a novel backdoor attack formulation that targets…

SkillAttack: Automated Red Teaming of Agent Skills through Attack Path Refinement

SkillAttack is a red-teaming framework that dynamically tests the exploitability…

Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems

The paper introduces Document-Driven Implicit Payload Execution (DDIPE) to demon…

Context Matters: Repository-Aware Security Analysis of the Agent Skill Ecosystem

This paper conducts a large-scale, repository-aware security analysis of AI agen…

Towards Secure Agent Skills: Architecture, Threat Taxonomy, and Security Analysis

This paper provides the first comprehensive security analysis of the Agent Skill…

"Elementary, My Dear Watson." Detecting Malicious Skills via Neuro-Symbolic Reasoning across Heterog…

The paper introduces MalSkills, a neuro-symbolic framework that detects maliciou…

SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems

SkillTrojan introduces a novel backdoor attack targeting the composition of reus…

SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration

The paper proposes SkillProbe, a multi-agent security auditing framework, demons…