Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Michael H. Conaway

Michael H. Conaway

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

Crypto×1AI×1NLP×1

Frequent co-authors

Tyler H. Merves1×
Joseph M. Escobar1×
Hakan T. Otal1×
Unal Tatar1×

Research Timeline

2026
Systematic Capability Benchmarking of Frontier Large Language Models for Offensive Cyber Tasks

This study provides a comprehensive benchmark of 10 frontier LLMs on 200 offensive cybersecurity tasks, finding that environment tooling and model selection are the primary performance drivers, with Claude 4.5 Opus achieving the highest solve rate.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.CLRecentApr 18, 2026

Systematic Capability Benchmarking of Frontier Large Language Models for Offensive Cyber Tasks

Tyler H. Merves, Michael H. Conaway, Joseph M. Escobar, Hakan T. Otal +1 more

This study provides a comprehensive benchmark of 10 frontier LLMs on 200 offensive cybersecurity tasks, finding that environment tooling and model selection are the primary performance drivers, with C…

View →