Tan Zhi-Xuan
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
AI×1NLP×1ML×1
Frequent co-authors
Research Timeline
2026
An Enigma of Artificial Reason: Investigating the Production-Evaluation Gap in Large Reasoning Models
This paper investigates the production-evaluation gap in Large Reasoning Models (LRMs), finding that while LRMs excel at generating solutions, they struggle significantly to evaluate flawed reasoning, often exhibiting an answer confirmation bias.
Highlighted terms show continued research focus across papers