Yihang Chen
6 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes AnaFP, a theoretically guided analytical fingerprinting scheme that determines the optimal distance of a model's fingerprint from the decision boundary to ensure both robustness and uniqueness for model ownership protection.
IrisFP introduces a novel adversarial-example-based framework that generates composite-sample fingerprints near the intersection of multiple decision boundaries, significantly enhancing model ownership verification robustness and uniqueness.
LiteGuard proposes an efficient task-agnostic model fingerprinting framework that achieves enhanced generalization and significantly reduces computational overhead compared to existing methods like MetaV.
The paper introduces FORCEBENCH, a new stress test designed to evaluate whether cited sources genuinely warrant the strength of a claim, revealing that standard citation evaluation methods often fail to detect over-strong claims.
The paper introduces SkillReact, a framework that measures compositional risk in agent skill ecosystems, finding that even if individual skills are safe, their combination can create significant, exploitable security vulnerabilities.
The paper introduces SkillReact, a framework that measures compositional risk in agent skill ecosystems, finding that even if individual skills are safe, their combination can create significant, unaddressed security vulnerabilities.
Papers
When Safe Skills Collide: Measuring Compositional Risk in Agent Skill Ecosystems
Su Wang, Pin Qian, Yihang Chen, Junxian You +5 more
The paper introduces SkillReact, a framework that measures compositional risk in agent skill ecosystems, finding that even if individual skills are safe, their combination can create significant, expl…