Chris Hicks
3 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper analyzes UK NIS Regulations data, finding that while 29% of reported incidents are cybersecurity related, the current regulations are limited in scope compared to the volume and nature of significant cyber threats.
This paper synthesizes expert knowledge from a workshop to provide a comprehensive framework and best-practice guidelines for developing high-quality reinforcement learning environments for autonomous cyber defense.
This paper identifies three core weaknesses—benchmark vulnerabilities, temporal staleness, and runtime uncertainty—that undermine current AI agent security evaluations and proposes directions for building more robust testing frameworks.
Papers
Measuring Security Without Fooling Ourselves: Why Benchmarking Agents Is Hard
This paper identifies three core weaknesses—benchmark vulnerabilities, temporal staleness, and runtime uncertainty—that undermine current AI agent security evaluations and proposes directions for buil…