ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

When Safe Skills Collide: Measuring Compositional Risk in Agent Skill Ecosystems

The paper introduces SkillReact, a framework that measures compositional risk in…

Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems

The paper introduces SkillVetBench, a novel two-stage benchmark that effectively…

SeClaw: Spec-Driven Security Task Synthesis for Evaluating Autonomous Agents

SeClaw is a new framework that synthesizes security tasks from structured risk s…

SkillsInjector: Dynamic Skill Context Construction for LLM Agents

SkillsInjector proposes a two-stage adaptive method to dynamically optimize skil…

Technical Report: Exploring the Emerging Threats of the Agent Skill Ecosystem

The paper analyzes a large sample of AI agent skills, revealing that a significa…

SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training

SIRI introduces a self-internalizing reinforcement learning framework that allow…

SkillRevise: Improving LLM-Authored Agent Skills via Trace-Conditioned Skill Revision

SkillRevise is an execution-grounded framework that iteratively refines initial,…

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents

The paper introduces MASA, a model-aware skill alignment framework that adaptive…