Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Dong Zhang

Dong Zhang

6 indexed papers

Recent (6 mo)
6
With code
0
Influential cites
0
Benchmarked
0

Publications per year

6
26

Top categories

AI×5ML×1NLP×1Social Networks×1Crypto×1

Frequent co-authors

Xudong Zhang3×
Jierui Lei1×
Jiacheng Li1×
Lingdong Shen1×
Jian Cui1×
Haina Tang1×

Research Timeline

2026
Hunting Vulnerability Variants in AI Infra: Measurement and Reference-Driven Detection

This paper measures the prevalence of recurring vulnerability patterns (variants) across multiple AI infrastructure repositories and proposes INFRASCOPE, a framework to automatically detect these variants.

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

The paper introduces HRBench, a unified and comprehensive evaluation framework for systematically benchmarking and comparing various thinking-mode switching strategies in hybrid-reasoning LLMs.

CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict

The paper introduces CyberJurors, a multi-agent framework and the VerdictBench benchmark to simulate and solve complex e-commerce dispute verdicts by modeling the reasoning and consensus process of crowdsourced jurors.

Global Policy-Space Response Oracles for Two-Player Zero-Sum Games

The paper introduces Global PSRO, a novel deep reinforcement learning framework that efficiently approximates Nash equilibria in large two-player zero-sum games by intelligently expanding the strategy set using a metric called Population Exploitability.

The Sword, Shield, and Achilles' Heel: Characterizing the Linguistic Inductive Bias of Large Language Models for Spatial Reasoning in Navigation Planning

The paper proposes a dual-interventional framework to characterize how linguistic structures and contextual cues influence LLMs' spatial reasoning for navigation, finding that topological information is crucial, while semantic details can be unreliable.

VLBM: Variational Latent Basis Modeling for OOD Robust Multivariate Time Series Forecasting

The paper proposes VLBM, a latent basis modeling framework, to achieve state-of-the-art robustness in multivariate time series forecasting, particularly when facing rare but high-impact out-of-distribution (OOD) events.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIRecentJun 1, 2026

VLBM: Variational Latent Basis Modeling for OOD Robust Multivariate Time Series Forecasting

Xudong Zhang, Jierui Lei, Jiacheng Li, Lingdong Shen +2 more

The paper proposes VLBM, a latent basis modeling framework, to achieve state-of-the-art robustness in multivariate time series forecasting, particularly when facing rare but high-impact out-of-distrib…

View →
cs.CLcs.AIRecentMay 29, 2026

The Sword, Shield, and Achilles' Heel: Characterizing the Linguistic Inductive Bias of Large Language Models for Spatial Reasoning in Navigation Planning

Xudong Zhang, Jian Yang, Shengkai Wang, Jiangpeng Tian +4 more

The paper proposes a dual-interventional framework to characterize how linguistic structures and contextual cues influence LLMs' spatial reasoning for navigation, finding that topological information…

View →
cs.AIRecentMay 27, 2026

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

Yansong Ning, Mianpeng Liu, Jingwen Ye, Weidong Zhang +1 more

The paper introduces HRBench, a unified and comprehensive evaluation framework for systematically benchmarking and comparing various thinking-mode switching strategies in hybrid-reasoning LLMs.

View →
cs.AIcs.SIRecentMay 27, 2026

CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict

Yanhui Sun, Wu Liu, Haifeng Ming, Xinru Wang +2 more

The paper introduces CyberJurors, a multi-agent framework and the VerdictBench benchmark to simulate and solve complex e-commerce dispute verdicts by modeling the reasoning and consensus process of cr…

View →
cs.AIRecentMay 27, 2026

Global Policy-Space Response Oracles for Two-Player Zero-Sum Games

Junyu Zhang, Feihong Yang, Jian Wang, Chao Wang +1 more

The paper introduces Global PSRO, a novel deep reinforcement learning framework that efficiently approximates Nash equilibria in large two-player zero-sum games by intelligently expanding the strategy…

View →
cs.CRRecentMay 19, 2026

Hunting Vulnerability Variants in AI Infra: Measurement and Reference-Driven Detection

Tian Dong, Yanjun Chen, Shoufeng Zhang, Huaien Zhang +5 more

This paper measures the prevalence of recurring vulnerability patterns (variants) across multiple AI infrastructure repositories and proposes INFRASCOPE, a framework to automatically detect these vari…

View →