Papers similar to 2605.18670v1

~ similar to 2605.18670v1· 20 results

cs.CRRecentApr 20, 2026

Audit-or-Cast: Enforcing Honest Elections with Privacy-Preserving Public Verification

Aman Rojjha, Gaurang Tandon, Varul Srivastava, Kannan Srinathan

The paper introduces ACE, a novel voting protocol that achieves end-to-end verifiability and strong voter privacy by combining tally-hiding aggregation with an Audit-or-Cast challenge, eliminating the…

View →

cs.CRRecentMar 23, 2026

Publicly Understandable Electronic Voting: A Non-Cryptographic, End-to-End Verifiable Scheme

Alon Gat

The paper proposes a non-cryptographic, End-to-End Verifiable (E2E-V) voting scheme that achieves Software-Free Verification (SFV) by allowing voters to audit election integrity using only basic arith…

View →

cs.CRcs.AIcs.CYRecentApr 28, 2026

Making AI-Assisted Grant Evaluation Auditable without Exposing the Model

Kemal Bicakci

The paper proposes a TEE-based architecture that enables external, auditable verification of AI-assisted grant evaluations without exposing the proprietary model, scoring logic, or intermediate reason…

View →

cs.CRRecentMar 31, 2026

On the Necessity of Pre-agreed Secrets for Thwarting Last-minute Coercion: Vulnerabilities and Lessons From the Loki E-voting Protocol

Jingxin Qiao, Myrto Arapinis, Thomas Zacharias

This paper analyzes the Loki e-voting protocol, demonstrating that while it attempts to solve coercion-resistance without pre-agreed secrets, it remains vulnerable to specific attacks, suggesting that…

View →

cs.CRRecentMay 6, 2026

Sealing the Audit-Runtime Gap for LLM Skills

Tingda Shen, Yebo Feng, Konglin Zhu, Xiaojun Jia +2 more

The paper introduces SIGIL, a novel framework that cryptographically seals the entire lifecycle of LLM skills, ensuring verifiable integrity from publication through runtime execution to prevent suppl…

View →

cs.CRcs.AIcs.CLRecentJun 2, 2026

Decoupled Smart Contract Audits: Lightweight LLM Framework via Distillation and Aggregation

Bagus Rakadyanto Oktavianto Putra, Muhamad Risqi Utama Saputra, Widyawan, Guntur Dharma Putra

The paper introduces an efficient, lightweight LLM framework for smart contract auditing that decouples the audit process into multiple components, achieving high accuracy while significantly reducing…

View →

cs.LGcs.CRRecentMay 25, 2026

On Reliability of Efficient Membership Inference Vulnerability Evaluation

Joonas Jälkö, Gauri Pradhan, Ossi Räisä, Antti Honkela

This paper analyzes the reliability of efficient membership inference attack (MIA) evaluation methods, demonstrating that standard aggregation techniques introduce biases that compromise accurate vuln…

View →

cs.CRcs.AIRecentMay 7, 2026

Narrow Secret Loyalty Dodges Black-Box Audits

Alfie Lamerton, Fabien Roger

The paper introduces and demonstrates 'narrow secret loyalties,' a novel type of covert model manipulation that biases model output toward a specific principal's interests under narrow conditions, whi…

View →

cs.CRcs.CYcs.LGRecentMay 7, 2026

Gaming the Metric, Not the Harm: Certifying Safety Audits against Strategic Platform Manipulation

Florian A. D. Burnat, Brittany I. Davidson

The paper demonstrates that current safety audit metrics are susceptible to strategic platform manipulation, proposing a more robust 'semantic-envelope' metric that better certifies genuine harm reduc…

View →

cs.AIRecentMay 29, 2026

PReMISE: Policy Rubrics as Measurement Specifications for LLM Judges

Swastik Roy, Rajkumar Pujari, Tharindu Kumarage, Charith Peris +4 more

PReMISE introduces a framework to audit and improve the quality of rubrics used to guide LLM judges, demonstrating that it can significantly increase judge accuracy and reduce the exploitability of re…

View →

cs.CRcs.AIRecentMay 25, 2026

Referential Security as a New Paradigm for AI Evaluations

Dan Ristea, Vasilios Mavroudis

The paper proposes referential security as a new paradigm for AI evaluation to ensure that safety claims and audits remain tied to specific, verifiable system instances despite continuous, unannounced…

View →

cs.CRcs.LGRecentApr 22, 2026

Breaking Bad: Interpretability-Based Safety Audits of State-of-the-Art LLMs

Krishiv Agarwal, Ramneet Kaur, Colin Samplawski, Manoj Acharya +5 more

The paper conducts an interpretability-driven safety audit of eight state-of-the-art LLMs, demonstrating that while interpretability-based steering is a powerful auditing tool, model robustness varies…

View →

cs.CRcs.AIcs.CLRecentMay 28, 2026

Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage

Shahinul Hoque, Jinghuai Zhang, Jinyuan Sun, Fnu Suya

The paper demonstrates that the current per-token billing model for LLMs is susceptible to systematic overcharging because auditing frameworks must rely on evidence provided by the very companies that…

View →

cs.CRcs.AIcs.CLRecentMay 28, 2026

Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage

Shahinul Hoque, Jinghuai Zhang, Jinyuan Sun, Fnu Suya

The paper demonstrates that the current per-token billing model for LLMs is susceptible to systematic inflation because auditing frameworks must rely on evidence provided by the service provider, crea…

View →

cs.CRcs.CLRecentApr 24, 2026

Behavioral Canaries: Auditing Private Retrieved Context Usage in RL Fine-Tuning

Chaoran Chen, Dayu Yuan, Peter Kairouz

The paper introduces Behavioral Canaries, a novel auditing mechanism that detects unauthorized use of private retrieved context data during Reinforcement Learning Fine-Tuning (RLFT) by inducing detect…

View →

cs.LGcs.CRcs.ITRecentMay 21, 2026

Optimal Guarantees for Auditing Rényi Differentially Private Machine Learning

Benjamin D. Kim, Lav R. Varshney, Daniel Alabi

The paper introduces an optimal black-box auditing framework using Donsker-Varadhan estimators to estimate Rényi differential privacy (RDP) guarantees for machine learning algorithms.

View →

cs.DScs.CRRecentMay 20, 2026

Near-Optimal Generalized Private Testing

Anamay Chaturvedi, Monika Henzinger, Jalaj Upadhyay

The paper introduces the Generalized Thresholding Mechanism (GTM) to solve the generalized private testing problem in differential privacy, achieving near-optimal accuracy and sample complexity guaran…

View →

cs.CRcs.AIRecentApr 28, 2026

Structured Security Auditing and Robustness Enhancement for Untrusted Agent Skills

Lijia Lv, Xuehai Tang, Jie Wen, Jizhong Han +1 more

The paper introduces SkillGuard-Robust, a novel framework for robust, cross-file security auditing of untrusted agent skills, achieving high accuracy on large-scale package evaluations.

View →

cs.CRcs.AIRecentMar 31, 2026

Security in LLM-as-a-Judge: A Comprehensive SoK

Aiman Al Masoud, Antony Anju, Marco Arazzi, Mert Cihangiroglu +5 more

This paper provides the first comprehensive Systematization of Knowledge (SoK) on the security aspects of LLM-as-a-Judge (LaaJ) systems, identifying key vulnerabilities and proposing a taxonomy for fu…

View →

cs.CRcs.CYeess.SPRecentMay 24, 2026

Pre-Characterization of Electromagnetic Side-Channel Leakage Using Publicly Available Information: A Case Study on E-Voting Interfaces

Leonardo Teodoro, Kemuel L. Vieira, Saulo Queiroz

The paper demonstrates that the Brazilian e-Voting Machine interface generates a simple and highly distinctive electromagnetic spectral signature, raising significant concerns about its susceptibility…

View →