Dawn Song

11 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×10AI×10ML×5Software Eng.×3NLP×3

Frequent co-authors

Jingxuan He4×

Zhun Wang3×

Chenguang Wang2×

Wenbo Guo2×

Kaihua Qin2×

Arthur Gervais2×

Research Timeline

2026

A Framework for Formalizing LLM Agent Security

The paper introduces a contextual security framework for LLM agents, defining security properties and reformulating various attacks and defenses based on the context of execution.

SecPI: Secure Code Generation with Reasoning Models via Security Reasoning Internalization

The paper introduces SecPI, a fine-tuning pipeline that teaches reasoning language models (RLMs) to autonomously internalize structured security reasoning, significantly improving secure code generation without requiring explicit security prompts at inference.

ExploitGym: Can AI Agents Turn Security Vulnerabilities into Real Attacks?

The paper introduces ExploitGym, a large-scale benchmark, demonstrating that advanced AI agents can successfully turn theoretical software vulnerabilities into working exploits, highlighting growing cybersecurity risks.

Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack

The paper introduces BenchJack, an automated red-teaming system that systematically audits popular AI agent benchmarks, revealing numerous reward-hacking exploits and demonstrating a method to significantly improve benchmark robustness.

Securing LLM Agents Need Intent-to-Execution Integrity

The paper proposes defining 'intent-to-execution integrity' as the necessary end-to-end correctness property for securing LLM agents, arguing that current defenses are insufficient due to untrusted components.

SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers

The paper introduces SCDBench, a comprehensive benchmark dataset and methodology that rigorously evaluates LLM-based smart contract decompilers, finding that while frontier LLMs can generate compilable code, achieving full semantic consistency remains a significant challenge.

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

This study provides the first large-scale measurement of prompt injection attacks in real-world LLM-based resume screening, finding that approximately 1% of resumes contain hidden injections.

SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers

The paper introduces SCDBench, a comprehensive benchmark dataset and methodology that rigorously evaluates LLM-based smart contract decompilers, finding that while frontier models can produce compilable code, achieving full semantic consistency remains a significant challenge.

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

This study provides the first systematic measurement of prompt injection attacks in a real-world LLM-based resume screening application, finding that approximately 1% of resumes contain hidden injections.

BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution

BenchEvolver introduces a solution-centric evolutionary framework to automatically transform saturated coding benchmarks into significantly harder, high-quality, and diverse evaluation suites.

CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

The paper introduces CyberGym-E2E, a large-scale, end-to-end benchmark designed to comprehensively evaluate AI agents' capabilities across the entire lifecycle of real-world software vulnerability discovery, proof-of-concept generation, and patch creation.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.LGRecentJun 3, 2026

CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

Tianneng Shi, Robin Rheem, Dongwei Jiang, Mona Wang +12 more

View →

cs.SEcs.AIcs.CLRecentMay 31, 2026