Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Xu Li

Xu Li

8 indexed papers

Recent (6 mo)
8
With code
0
Influential cites
0
Benchmarked
0

Publications per year

8
26

Top categories

AI×5Crypto×3NLP×2Multimedia×1Sound×1

Frequent co-authors

Xinyi Li3×
Xun Han2×
Chengzhengxu Li2×
Zhaohan Zhang2×
Zedong Fu1×
Hanzhe Tu1×

Research Timeline

2026
From Context to Rules: Toward Unified Detection Rule Generation

The paper proposes UniRule, a novel agentic RAG framework that unifies the detection rule generation process by mapping context and language to rules, significantly outperforming pure LLM generation.

MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors

The paper introduces MGTEVAL, a comprehensive and extensible platform designed to systematically evaluate the performance, robustness, and efficiency of machine-generated text detectors.

FragBench: Cross-Session Attacks Hidden in Benign-Looking Fragments

The paper introduces FragBench, a novel benchmark designed to detect malicious LLM attacks that are split across multiple, seemingly benign sessions, showing that cross-session graph modeling is necessary for effective defense.

Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor

The paper introduces Thinking as Compression (TaC), a novel paradigm showing that the inherent reasoning process of a large language model can naturally compress long context inputs, outperforming dedicated compression methods.

Bridging the Detection-to-Abstention Gap in Reasoning Models under Insufficient Information

The paper addresses the 'detection-to-abstention gap' in reasoning models, where detecting insufficient information does not lead to abstention, by proposing a novel control framework that forces models to commit to an answerability judgment before solving.

MTAVG-Bench 2.0: Diagnosing Failure Modes of Cinematic Expressiveness in Multi-Talker Audio-Video Generation

The paper introduces MTAVG-Bench 2.0, a new benchmark designed to diagnose high-level failure modes of cinematic expressiveness in multi-talker audio-video generation, showing that even advanced models struggle with complex scene-level failures.

EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation

EvoGens is an evolution-inspired framework that treats scientific idea generation as an evolutionary search, significantly boosting the novelty and diversity of generated research ideas compared to existing LLM-based methods.

TrafficRAG: A Multimodal RAG Framework for Traffic Accident Liability Determination

TrafficRAG is a multimodal retrieval-augmented framework that automates traffic accident liability determination by integrating visual evidence, structured legal knowledge, and advanced LLM reasoning.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentJun 1, 2026

TrafficRAG: A Multimodal RAG Framework for Traffic Accident Liability Determination

Xu Li, Zedong Fu, Xinyi Li, Xun Han

TrafficRAG is a multimodal retrieval-augmented framework that automates traffic accident liability determination by integrating visual evidence, structured legal knowledge, and advanced LLM reasoning.

View →
cs.CLRecentMay 29, 2026

EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation

Xu Li, Hanzhe Tu, Xinyi Li, Kuncheng Zhao +2 more

EvoGens is an evolution-inspired framework that treats scientific idea generation as an evolutionary search, significantly boosting the novelty and diversity of generated research ideas compared to ex…

View →
cs.AIRecentMay 27, 2026

Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor

Guoxin Ma, Yibing Liu, Chengzhengxu Li, Yu Liang +6 more

The paper introduces Thinking as Compression (TaC), a novel paradigm showing that the inherent reasoning process of a large language model can naturally compress long context inputs, outperforming ded…

View →
cs.AIRecentMay 27, 2026

Bridging the Detection-to-Abstention Gap in Reasoning Models under Insufficient Information

Renjie Gu, Jiaxu Li, Yihao Wang, Yun Yue +7 more

The paper addresses the 'detection-to-abstention gap' in reasoning models, where detecting insufficient information does not lead to abstention, by proposing a novel control framework that forces mode…

View →
cs.AIcs.MMcs.SDRecentMay 27, 2026

MTAVG-Bench 2.0: Diagnosing Failure Modes of Cinematic Expressiveness in Multi-Talker Audio-Video Generation

Haitian Li, Yanghao Zhou, Heyan Huang, Liangji Chen +14 more

The paper introduces MTAVG-Bench 2.0, a new benchmark designed to diagnose high-level failure modes of cinematic expressiveness in multi-talker audio-video generation, showing that even advanced model…

View →
cs.CRcs.AIRecentMay 10, 2026

FragBench: Cross-Session Attacks Hidden in Benign-Looking Fragments

Astha Mehta, Niruthiha Selvanayagam, Cedric Lam, Hengxu Li +9 more

The paper introduces FragBench, a novel benchmark designed to detect malicious LLM attacks that are split across multiple, seemingly benign sessions, showing that cross-session graph modeling is neces…

View →
cs.CRcs.CLRecentApr 28, 2026

MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors

Yuanfan Li, Qi Zhou, Chengzhengxu Li, Zhaohan Zhang +4 more

The paper introduces MGTEVAL, a comprehensive and extensible platform designed to systematically evaluate the performance, robustness, and efficiency of machine-generated text detectors.

View →
cs.CRRecentApr 13, 2026

From Context to Rules: Toward Unified Detection Rule Generation

Cheng Meng, Wenxin Le, Xinyi Li, Qiuyun Wang +3 more

The paper proposes UniRule, a novel agentic RAG framework that unifies the detection rule generation process by mapping context and language to rules, significantly outperforming pure LLM generation.

View →