Papers similar to 2605.07674v1

~ similar to 2605.07674v1· 20 results

cs.LGcs.CRcs.ITRecentMay 21, 2026

Optimal Guarantees for Auditing Rényi Differentially Private Machine Learning

Benjamin D. Kim, Lav R. Varshney, Daniel Alabi

The paper introduces an optimal black-box auditing framework using Donsker-Varadhan estimators to estimate Rényi differential privacy (RDP) guarantees for machine learning algorithms.

View →

cs.CRRecentMay 14, 2026

Privacy Auditing with Zero (0) Training Run

Tudor Cebere, Mathieu Even, Linus Bleistein, Aurélien Bellet

The paper introduces Zero-Run privacy auditing, a post-hoc framework that allows for practical differential privacy evaluation of large, deployed models without requiring retraining or controlled data…

View →

cs.CRcs.CYcs.LGRecentMay 7, 2026

Gaming the Metric, Not the Harm: Certifying Safety Audits against Strategic Platform Manipulation

Florian A. D. Burnat, Brittany I. Davidson

The paper demonstrates that current safety audit metrics are susceptible to strategic platform manipulation, proposing a more robust 'semantic-envelope' metric that better certifies genuine harm reduc…

View →

cs.CRcs.AIcs.LGRecentApr 20, 2026

Tight Auditing of Differential Privacy in MST and AIM

Georgi Ganev, Meenatchi Sundaram Muthu Selva Annamalai, Bogdan Kulynych

The paper introduces a Gaussian Differential Privacy (GDP)-based auditing framework to provide the first tight audits of privacy guarantees for state-of-the-art synthetic data generators like MST and…

View →

cs.CRcs.AIRecentMay 7, 2026

Narrow Secret Loyalty Dodges Black-Box Audits

Alfie Lamerton, Fabien Roger

The paper introduces and demonstrates 'narrow secret loyalties,' a novel type of covert model manipulation that biases model output toward a specific principal's interests under narrow conditions, whi…

View →

cs.CRcs.ITRecentApr 9, 2026

Realisation-Level Privacy Filtering

Sophie Taylor, Praneeth Vippathalla, Justin Coon

The paper introduces a novel realization-level privacy filtering approach that improves utility in differentially private data release by accounting for actual leakage rather than worst-case per-round…

View →

cs.DScs.CRRecentMay 20, 2026

Near-Optimal Generalized Private Testing

Anamay Chaturvedi, Monika Henzinger, Jalaj Upadhyay

The paper introduces the Generalized Thresholding Mechanism (GTM) to solve the generalized private testing problem in differential privacy, achieving near-optimal accuracy and sample complexity guaran…

View →

cs.CRcs.AIcs.CLRecentMay 28, 2026

Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage

Shahinul Hoque, Jinghuai Zhang, Jinyuan Sun, Fnu Suya

The paper demonstrates that the current per-token billing model for LLMs is susceptible to systematic overcharging because auditing frameworks must rely on evidence provided by the very companies that…

View →

cs.CRcs.AIcs.CLRecentMay 28, 2026

Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage

Shahinul Hoque, Jinghuai Zhang, Jinyuan Sun, Fnu Suya

The paper demonstrates that the current per-token billing model for LLMs is susceptible to systematic inflation because auditing frameworks must rely on evidence provided by the service provider, crea…

View →

cs.CRcs.AIRecentApr 10, 2026

BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning

Guiyao Tie, Jiawen Shi, Pan Zhou, Lichao Sun

The paper introduces BadSkill, a novel backdoor attack formulation that targets third-party agent skills by poisoning the embedded model artifacts, achieving high attack success rates across various m…

View →

cs.AIcs.MARecentMay 29, 2026

Healthcare Mechanisms from Policy-as-Code Search under Strategic Provider Response

Zihan Wang, Xiang Xu, Hongyuan Zha, Wenhao Li

The paper models healthcare mechanism design as program synthesis, demonstrating that an optimized, mixed-objective program can eliminate up-coding and reduce patient rejection while maintaining finan…

View →

cs.CRRecentMay 15, 2026

Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning

Wenhao Wang, Shujie Cui, Hui Cui, Xingliang Yuan

This paper corrects the theoretical analysis of DP-SGD by identifying that common implementations, which use batch averaging, result in weaker privacy guarantees than previously reported.

View →

cs.CRcs.ITRecentMay 4, 2026

Optimal Privacy-Utility Trade-Offs in LDP: Functional and Geometric Perspectives

Seung-Hyun Nam, Hyun-Young Park, Si-Hyeon Lee

The paper develops a unified theoretical framework to systematically characterize the optimal privacy-utility trade-off (PUT) and optimal Local Differential Privacy (LDP) channels for general statisti…

View →

cs.ITcs.CRcs.LGRecentMay 28, 2026

Local Differential Privacy with Correlated Noise Achieves Central-DP Optimal Cost

Madhura Pathegama, Srikanth Avasarala, Viveck R. Cadambe, Juba Ziani

The paper demonstrates that by introducing carefully designed correlations among locally added noise variables, local differential privacy mechanisms can achieve an estimation cost matching the optima…

View →

cs.CRcs.AIRecentMar 18, 2026

Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs

Ya-Ting Yang, Quanyan Zhu

This paper develops a differential privacy framework to analyze and optimize privacy leakage from AI agent responses that utilize sensitive enterprise data, focusing on deriving optimal generation par…

View →

cs.LGcs.AIcs.CRRecentApr 17, 2026

DPrivBench: Benchmarking LLMs' Reasoning for Differential Privacy

Erchi Wang, Pengrun Huang, Eli Chien, Om Thakkar +3 more

The paper introduces DPrivBench, a new benchmark to test whether large language models (LLMs) can automate the complex reasoning required to verify differential privacy guarantees for algorithms.

View →

stat.MLcs.LGRecentJun 2, 2026

Privacy-Robust Incrementality Measurement for Advertising Systems under Signal Loss

Prashant Shekhar, Caroline Howard

The paper proposes a robust causal decision framework to measure advertising incrementality despite multiple sources of privacy-induced signal degradation, providing certified decisions on the strengt…

View →

cs.CRcs.MARecentApr 26, 2026

Breaking the Secret: Economic Interventions for Combating Collusion in Embodied Multi-Agent Systems

Qi Liu, Xiaohui Chen, Zhihui Zhao, Yaowen Zheng +4 more

The paper proposes a mutagenic incentive intervention approach that mitigates collusion in embodied multi-agent systems by reshaping agents' payoff structures, effectively inducing defection and maint…

View →

cs.CRcs.AIRecentMay 29, 2026

PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say

Mingxuan Zhang, Jiahui Han, Dadi Guo, Songze Li +4 more

The paper introduces PrivacyPeek, a new benchmark that audits the acquisition stage of LLM-based agents to demonstrate that unnecessary acquisition of sensitive data is a widespread and critical priva…

View →

cs.CRcs.AIRecentMay 29, 2026

PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say

Mingxuan Zhang, Jiahui Han, Dadi Guo, Songze Li +4 more

The paper introduces PrivacyPeek, a new benchmark that audits the acquisition stage of LLM-based agents to show that unnecessary and sensitive data acquisition is a widespread and critical privacy vul…

View →