Marcus Sousa

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

ML×1Crypto×1

Frequent co-authors

Ryle Goehausen1×

Research Timeline

2026

Gate AI: LLM Security Benchmark Evaluation Methodology and Results

The paper introduces a robust evaluation methodology, Gate AI, to accurately benchmark LLM security detectors by eliminating systematic weaknesses like per-dataset threshold tuning and undisclosed operating points.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.CRRecentJun 1, 2026

Gate AI: LLM Security Benchmark Evaluation Methodology and Results

Ryle Goehausen, Marcus Sousa

View →