Arthur Gervais

7 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×7AI×6Software Eng.×2Logic×1

Frequent co-authors

Isaac David5×

Kaihua Qin2×

Dawn Song2×

Marco Guarnieri1×

Research Timeline

2026

Towards Optimal Agentic Architectures for Offensive Security Tasks

The paper empirically evaluates various agentic architectures for offensive security tasks, finding that while broader coordination improves coverage, the optimal architecture is non-monotonic and depends heavily on cost, latency, and exploit difficulty.

Alignment Contracts for Agentic Security Systems

The paper introduces alignment contracts, a formal framework for specifying and enforcing behavioral constraints over observable effect traces, ensuring that powerful agentic security systems operate only within defined scopes.

Patch2Vuln: Agentic Reconstruction of Vulnerabilities from Linux Distribution Binary Patches

The paper introduces Patch2Vuln, a pipeline that uses an LLM agent to reconstruct security vulnerabilities by analyzing differences between old and new Linux binary packages, successfully localizing patches in a majority of tested cases.

Ablating Safety: Mechanisms for Removing Alignment in Language Models for Security Applications

The paper proposes Ablating Safety, a controlled protocol for removing safety alignment from language models, demonstrating that targeted de-alignment can significantly boost security performance while maintaining general capability and controlled unsafe compliance.

Measuring Safety Alignment Effects in Autonomous Security Agents

The study evaluates how safety alignment affects autonomous security agents using a comprehensive trace-based benchmark, finding that while less-restricted models show gains, these effects are not universal and require system-level measurement.

SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers

The paper introduces SCDBench, a comprehensive benchmark dataset and methodology that rigorously evaluates LLM-based smart contract decompilers, finding that while frontier LLMs can generate compilable code, achieving full semantic consistency remains a significant challenge.

SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers

The paper introduces SCDBench, a comprehensive benchmark dataset and methodology that rigorously evaluates LLM-based smart contract decompilers, finding that while frontier models can produce compilable code, achieving full semantic consistency remains a significant challenge.

Highlighted terms show continued research focus across papers

Papers

cs.SEcs.AIcs.CRRecentMay 27, 2026

SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers

Kaihua Qin, Dawn Song, Arthur Gervais

View →

cs.SEcs.AIcs.CRRecentMay 27, 2026