The paper presents a novel attack demonstrating that exploiting symmetries can defeat standard auditing mechanisms applied to Introspection Adapters.
Abstract
More Like ThisWe demonstrate an attack on Introspection Adapters (Shenoy et al., 2026).
The paper presents a novel attack demonstrating that exploiting symmetries can defeat standard auditing mechanisms applied to Introspection Adapters.
We demonstrate an attack on Introspection Adapters (Shenoy et al., 2026).
Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Sourc…
This paper introduces a framework to audit source-dependence in multi-source RAG…