Arlindo Oliveira
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
AI×1NLP×1
Frequent co-authors
Research Timeline
2026
The Importance of Being Statistically Earnest: A Critical Re-evaluation of GSM-Symbolic
The paper challenges the conclusion that LLMs lack reasoning by demonstrating that reported performance drops on GSM-Symbolic are often statistically weak and partially attributable to dataset biases, while also identifying specific failure modes.
Highlighted terms show continued research focus across papers