Florian A. D. Burnat
4 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper demonstrates that current safety audit metrics are susceptible to strategic platform manipulation, proposing a more robust 'semantic-envelope' metric that better certifies genuine harm reduction.
This paper analyzes differential privacy auditing as a bilevel game, showing that naive audit designs fail to detect true harm when developers strategically respond, and proposes an optimal, single-level design algorithm.
The paper introduces the quotient semivalue mechanism to provide fair data attribution that is resistant to contributors manipulating their reported identities by splitting or duplicating data.
This paper demonstrates that standard privacy guarantees for multi-tenant RAG services fail when multiple accounts from the same tenant collude, proposing a novel audit protocol to quantify this joint leakage.
Papers
Auditing Privacy in Multi-Tenant RAG under Account Collusion
This paper demonstrates that standard privacy guarantees for multi-tenant RAG services fail when multiple accounts from the same tenant collude, proposing a novel audit protocol to quantify this joint…