Qiuyu Tian
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
AI×1
Frequent co-authors
Research Timeline
2026
ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment
The paper introduces ForeSci, a novel benchmark that evaluates LLM agents' ability to make forward-looking research judgments using only historical evidence, finding that explicit evidence organization improves performance but agents often decouple evidence from correct predictions.
Highlighted terms show continued research focus across papers
Papers
cs.AIRecentMay 30, 2026
ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment
Qiuyu Tian, Zequn Liu, Yingce Xia, Haojie Yin +1 more
The paper introduces ForeSci, a novel benchmark that evaluates LLM agents' ability to make forward-looking research judgments using only historical evidence, finding that explicit evidence organizatio…
View →