Martin Heidebach
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
NLP×1AI×1
Frequent co-authors
Research Timeline
2026
BenGER: Benchmarking LLM Systems on Subsumption-Based Legal Reasoning in German Law
The paper introduces BenGER, a comprehensive benchmark for evaluating LLMs on German legal reasoning, demonstrating that closed-flagship models perform best and that human-AI co-creation significantly improves results over unaided human performance.
Highlighted terms show continued research focus across papers