Hannah Rose Kirk
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
NLP×1
Frequent co-authors
Research Timeline
2026
RealityTest: How People Probe AI Identity and Whether Models Disclose It
RealityTest introduces a large-scale, multimodal, and multilingual benchmark using real-world human data to test how AI systems disclose their identity, finding that context and phrasing are more critical than the model itself.
Highlighted terms show continued research focus across papers