Xiang Zhang

10 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×6NLP×3Crypto×3Info Retrieval×2ML×2

Frequent co-authors

Yuxiang Zhang2×

HONOR Agentic Search Team1×

Zhengzong Chen1×

Lei Tang1×

Lijun Liu1×

Chuandi Jiang1×

Research Timeline

2026

Spatiotemporal-Aware Bit-Flip Injection on DNN-based Advanced Driver Assistance Systems (extended version)

The paper introduces a Spatiotemporal-Aware Fault Injection (STAFI) framework to efficiently locate and time critical bit-flip vulnerabilities in DNNs used for ADAS, significantly improving fault detection compared to existing methods.

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

The paper introduces AgentWard, a lifecycle-oriented, defense-in-depth architecture designed to systematically secure autonomous AI agents by protecting them across all stages of their operation.

When Alignment Isn't Enough: Response-Path Attacks on LLM Agents

This paper introduces the Relay Tampering Attack (RTA), demonstrating that malicious third-party relays can undermine the security of LLM agents by modifying responses post-alignment, even if the LLM itself is perfectly aligned.

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

The paper distinguishes between a model's ability to generate useful updates for external agent components (harness-updating) and its ability to benefit from those updates (harness-benefit), finding that updating capabilities are surprisingly uniform while benefit is maximized in mid-tier models.

Enhancing Multi-Agent Communication through Attention Steering with Context Relevance

The paper introduces Agent-Radar, a training-free method that dynamically steers multi-agent attention toward relevant context using a novel decay mechanism, significantly improving performance in long-running LLM conversations.

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

The paper analyzes observation masking in long-horizon search agents, finding that its effectiveness depends on a complex interaction between the model's capacity and the retriever's strength, exhibiting an inverted-U shaped gain.

WorldCoder-Bench: Benchmarking Physically Grounded 3D World Synthesis

The paper introduces WorldCoder-Bench, a comprehensive benchmark and evaluation protocol for testing LLMs' ability to autonomously generate complex, physically grounded, and interactive 3D web worlds.

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

The paper introduces TVIR, a new benchmark and multi-agent framework for deep research, to evaluate and improve the generation of factually reliable, text-visual interleaved reports.

Understanding Evaluation Illusion in Diffusion Large Language Models

This paper evaluates the consistency and effectiveness of decoding methods for diffusion large language models (dLLMs) across diverse evaluation settings and reveals their sensitivity to prompt templates.

MagicSelector: Joint Optimization for Agent Tool Selection via Counterfactual Decomposition and Progressive Reranking

MagicSelector is a framework for tool retrieval in agents using counterfactual task decomposition, progressive reranking, and dynamic Top-K.

Highlighted terms show continued research focus across papers

Papers

cs.IREmpiricalRecentJul 20, 2026

MagicSelector: Joint Optimization for Agent Tool Selection via Counterfactual Decomposition and Progressive Reranking

HONOR Agentic Search Team, Zhengzong Chen, Lei Tang, Lijun Liu +26 more

MagicSelector is a framework for tool retrieval in agents using counterfactual task decomposition, progressive reranking, and dynamic Top-K.

View →

cs.CLcs.LGEmpirical