Papers similar to 2605.21780v1

~ similar to 2605.21780v1· 20 results

cs.CRcs.LGRecentApr 22, 2026

Towards Certified Malware Detection: Provable Guarantees Against Evasion Attacks

Nandakrishna Giri, Asmitha K. A., Serena Nicolazzo, Antonino Nocera +1 more

The paper proposes a certifiably robust malware detection framework using randomized smoothing and feature ablation to guarantee detection accuracy against metamorphic evasion attacks.

View →

cs.LGcs.CRmath.OCRecentMar 24, 2026

Byzantine-Robust and Differentially Private Federated Optimization under Weaker Assumptions

Rustem Islamov, Grigory Malinovsky, Alexander Gaponov, Aurelien Lucchi +2 more

The paper proposes Byz-Clip21-SGD2M, a novel algorithm that achieves high-probability convergence guarantees for Federated Learning by integrating robust aggregation, double momentum, and clipping, re…

View →

cs.CRcs.AIRecentApr 27, 2026

Defusing the Trigger: Plug-and-Play Defense for Backdoored LLMs via Tail-Risk Intrinsic Geometric Smoothing

Kaisheng Fan, Weizhe Zhang, Yishu Gao, Tegawendé F. Bissyandé +1 more

The paper introduces Tail-risk Intrinsic Geometric Smoothing (TIGS), a plug-and-play, inference-time defense that suppresses backdoor attacks on LLMs by structurally smoothing the attention mechanism…

View →

cs.CRRecentMay 15, 2026

Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning

Wenhao Wang, Shujie Cui, Hui Cui, Xingliang Yuan

This paper corrects the theoretical analysis of DP-SGD by identifying that common implementations, which use batch averaging, result in weaker privacy guarantees than previously reported.

View →

cs.LGcs.CRstat.MLTheoreticalRecentJul 10, 2026

Statistically Undetectable Backdoors in Deep Neural Networks

Andrej Bogdanov, Alon Rosen, Neekon Vafa

An adversarial model trainer can plant statistically undetectable backdoors in deep neural networks, providing access to adversarial examples.

View →

cs.LGcs.CRRecentMay 27, 2026

Density-aware Sample-specific Attack

Qiyuan Wang, Yao Li, Raymond K. W. Wong

This paper proposes a density-aware attack that constructs triggers by placing poisoned samples in low-density regions of the clean data distribution, achieving high attack success rates even after st…

View →

cs.CRcs.AIRecentMay 11, 2026

Guaranteed Jailbreaking Defense via Disrupt-and-Rectify Smoothing

Zheng Lin, Zhenxing Niu, Haoxuan Ji, Haichang Gao

The paper introduces Disrupt-and-Rectify Smoothing (DR-Smoothing), a novel two-stage defense mechanism that significantly improves LLM security against jailbreaking attacks by restoring disrupted inpu…

View →

cs.LGcs.CRRecentMay 19, 2026

An exponential mechanism based on quadratic approximations for fine-tuning machine learning models with privacy guarantees

Hoang Tran, Jorge Ramirez, Jiayi Wang, Alberto Bocchinfuso +2 more

The paper proposes a novel exponential mechanism using quadratic approximations to fine-tune machine learning models on sensitive data while providing strong differential privacy guarantees.

View →

cs.CRcs.AIcs.LGRecentMay 27, 2026

Mind the Gap: Mixtures of Gaussians in Approximate Differential Privacy

Huikang Liu, Aras Selvi, Wolfram Wiesemann

The paper introduces 'mixture mechanisms,' a novel class of additive noise mechanisms that achieve approximate differential privacy by mixing multiple Gaussian distributions, resulting in lower noise…

View →

cs.CRcs.AIcs.LGRecentMay 27, 2026

Mind the Gap: Mixtures of Gaussians in Approximate Differential Privacy

Huikang Liu, Aras Selvi, Wolfram Wiesemann

The paper introduces 'mixture mechanisms,' a novel class of additive noise mechanisms that achieve differential privacy for real-valued queries, significantly reducing noise compared to the standard G…

View →

cs.CRRecentMay 26, 2026

Beyond Epsilon: A Principled QIF Framework for Local Differential Privacy

Ramon G. Gonze, Natasha Fernandes, Heber H. Arcolezi, Catuscia Palamidessi +1 more

The paper proposes a Quantitative Information Flow (QIF) framework to systematically and rigorously compare Local Differential Privacy (LDP) frequency estimation protocols, moving beyond simple $\vare…

View →

cs.ITcs.CRmath.STRecentMar 21, 2026

Composition Theorems for Multiple Differential Privacy Constraints

Cemre Cadir, Salim Najib, Yanina Y. Shkel

The paper develops a general framework to exactly characterize the composition of mechanisms satisfying multiple differential privacy constraints, extending known results to arbitrary numbers of const…

View →

cs.CRcs.LGRecentMar 19, 2026

Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference

Pranay Anchuri, Matteo Campanelli, Paul Cesaretti, Rosario Gennaro +3 more

The paper introduces a lightweight, sampling-based cryptographic protocol for verifiable AI inference that drastically reduces proving overhead from minutes to milliseconds by leveraging statistical p…

View →

cs.LGcs.CRcs.ITRecentMay 21, 2026

Optimal Guarantees for Auditing Rényi Differentially Private Machine Learning

Benjamin D. Kim, Lav R. Varshney, Daniel Alabi

The paper introduces an optimal black-box auditing framework using Donsker-Varadhan estimators to estimate Rényi differential privacy (RDP) guarantees for machine learning algorithms.

View →

cs.CRRecentMay 14, 2026

Privacy Auditing with Zero (0) Training Run

Tudor Cebere, Mathieu Even, Linus Bleistein, Aurélien Bellet

The paper introduces Zero-Run privacy auditing, a post-hoc framework that allows for practical differential privacy evaluation of large, deployed models without requiring retraining or controlled data…

View →

cs.CRcs.ITRecentMay 20, 2026

Information Leakage Envelopes

Sara Saeidian, Carlos Pinzón, Catuscia Palamidessi

The paper introduces the PML envelope, a novel definition that provides a robust and operationally meaningful measure of information leakage about a secret, satisfying both post-processing robustness…

View →

cs.CRcs.AIcs.LGRecentApr 9, 2026

TADP-RME: A Trust-Adaptive Differential Privacy Framework for Enhancing Reliability of Data-Driven Systems

Labani Halder, Payel Sadhukhan, Sarbani Palit

TADP-RME introduces a trust-adaptive differential privacy framework that enhances data system reliability by dynamically adjusting the privacy budget based on user trust and disrupting geometric struc…

View →

cs.LGcs.AIcs.CRRecentApr 22, 2026

Differentially Private Model Merging

Qichuan Yin, Manzil Zaheer, Tian Li

This paper proposes two post-processing techniques, random selection and linear combination, to construct a model that satisfies any desired differential privacy level without retraining, given a set…

View →

cs.LGcs.CRRecentMay 25, 2026

On Reliability of Efficient Membership Inference Vulnerability Evaluation

Joonas Jälkö, Gauri Pradhan, Ossi Räisä, Antti Honkela

This paper analyzes the reliability of efficient membership inference attack (MIA) evaluation methods, demonstrating that standard aggregation techniques introduce biases that compromise accurate vuln…

View →

cs.LGcs.AIRecentMay 31, 2026

CEAR: Certified Ensemble Adversarial Robustness in DNNs

Daniel Sadig, Mohammadreza Maleki, Hamed Karimi, Reza Samavi

The paper proposes CEAR, an ensemble-based method that combines empirical and certified defenses to achieve superior provable robustness against adversarial attacks in Deep Neural Networks.

View →