Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Victor Gillioz

Victor Gillioz

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

NLP×1AI×1ML×1

Frequent co-authors

Aditya Sinha1×
Akshat Naik1×
Simon Storf1×
Kilian Merkelbach1×
Rich Barton-Cooper1×
Axel Højmark1×

Research Timeline

2026
Training Deliberative Monitors for Black-Box Scheming Detection

The paper introduces a novel method for training low-cost, action-only deliberative monitors that detect scheming behavior in autonomous agents, achieving high performance comparable to expensive frontier models.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.LGRecentMay 28, 2026

Training Deliberative Monitors for Black-Box Scheming Detection

Aditya Sinha, Akshat Naik, Victor Gillioz, Simon Storf +4 more

The paper introduces a novel method for training low-cost, action-only deliberative monitors that detect scheming behavior in autonomous agents, achieving high performance comparable to expensive fron…

View →