Tan Zhi-Xuan

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×1NLP×1ML×1

Frequent co-authors

Mingzhong Sun1×

Teresa Yeo1×

Armando Solar-Lezama1×

Research Timeline

2026

An Enigma of Artificial Reason: Investigating the Production-Evaluation Gap in Large Reasoning Models

This paper investigates the production-evaluation gap in Large Reasoning Models (LRMs), finding that while LRMs excel at generating solutions, they struggle significantly to evaluate flawed reasoning, often exhibiting an answer confirmation bias.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.CLcs.LGRecentMay 31, 2026

An Enigma of Artificial Reason: Investigating the Production-Evaluation Gap in Large Reasoning Models

Mingzhong Sun, Teresa Yeo, Armando Solar-Lezama, Tan Zhi-Xuan

View →