Han Zhang

17 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×10AI×9NLP×6ML×5Software Eng.×1Comp. Eng.×1Info Retrieval×1Society×1

Frequent co-authors

Xiaohan Zhang4×

Mi Zhang2×

Min Yang2×

Zihan Zhang2×

Mohan Zhang2×

Yuqi Jia2×

Research Timeline

2026

Deanonymizing Bitcoin Transactions via Network Traffic Analysis with Semi-supervised Learning

The paper proposes NTSSL, a novel semi-supervised method that combines network traffic analysis and transaction clustering to significantly improve the deanonymization of Bitcoin transactions.

MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors

The paper introduces MGTEVAL, a comprehensive and extensible platform designed to systematically evaluate the performance, robustness, and efficiency of machine-generated text detectors.

When Alignment Isn't Enough: Response-Path Attacks on LLM Agents

This paper introduces the Relay Tampering Attack (RTA), demonstrating that malicious third-party relays can undermine the security of LLM agents by modifying responses post-alignment, even if the LLM itself is perfectly aligned.

Trusted Weights, Treacherous Optimizations? Optimization-Triggered Backdoor Attacks on LLMs

This paper introduces a novel class of backdoor attacks that exploit the numerical side effects of LLM inference optimization, achieving high success rates while maintaining clean accuracy.

A First Measurement Study on Authentication Security in Real-World Remote MCP Servers

This study provides the first measurement of authentication security in real-world remote Model Context Protocol (MCP) servers, finding pervasive and critical authentication weaknesses, particularly in dynamic client registration.

SolarChain: Bridging Physical Law, Verifiable Trust, and Sustainable Markets for Urban Energy Resilience

SolarChain is a platform that ensures verifiable trust in decentralized solar energy markets by anchoring digital energy credits to the hard physical limits of solar yield, thereby preventing data manipulation and speculative trading.

Steering Beyond the Support: Adversarial Training on Unsupervised Jailbroken Activation Simulation

The paper proposes an unsupervised bi-level adversarial training framework to enhance LLM safety steering, achieving strong zero-shot defense against unseen and evolving jailbreak prompts.

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

This study provides the first large-scale measurement of prompt injection attacks in real-world LLM-based resume screening, finding that approximately 1% of resumes contain hidden injections.

Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor

The paper introduces Thinking as Compression (TaC), a novel paradigm showing that the inherent reasoning process of a large language model can naturally compress long context inputs, outperforming dedicated compression methods.

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

This study provides the first systematic measurement of prompt injection attacks in a real-world LLM-based resume screening application, finding that approximately 1% of resumes contain hidden injections.

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory

The paper introduces RHELM, a new benchmark designed to test LLMs' long-term memory by simulating realistic, complex, and evolving dialogues that integrate multiple heterogeneous data sources.

What Makes a Strong Model? A Unified Spectral Analysis of Knowledge Transfer over High-dimensional Linear Regression

This paper develops a unified spectral analysis framework to explain how knowledge transfer (KT) works across different machine learning regimes, such as Knowledge Distillation and Weak-to-Strong generalization.

Can LLM Agents Sustain Long-Horizon Organizational Dynamics?

The paper introduces TaskWeave, a hierarchical agentic framework that successfully simulates long-horizon organizational dynamics by treating coordination as a memory-centered problem, demonstrating that structured memory is key to reliable LLM-based simulations.

MsFEM-Inspired CNNs with Transfer Learning for Multiscale Model Reduction

The paper proposes MITL, an MsFEM-inspired transfer learning strategy for CNN-based reduced-order models, enabling efficient and adaptable approximation of multiscale systems with minimal retraining.

CultureForest: Understanding and Evaluating Cultural Norm Grounded Reasoning in LLMs

The paper introduces CultureForest, a new benchmark for evaluating Cultural Norm Grounded Reasoning in LLMs, demonstrating that models struggle to apply their cultural knowledge effectively in realistic, open-ended scenarios.

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards

QUBRIC introduces a co-design framework that simultaneously optimizes queries and rubrics, overcoming the bottleneck of vague rubrics derived from open-ended questions, leading to significant gains in RL performance.

Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

This paper investigates Description-Code Inconsistency (DCI) in Model Context Protocol (MCP) servers, finding that 9.93% of real-world tools exhibit inconsistencies that create security blind spots.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.SERecentJun 3, 2026

Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

Yutao Shi, Xiaohan Zhang, Xiangjing Zhang, Xihua Shen +4 more

This paper investigates Description-Code Inconsistency (DCI) in Model Context Protocol (MCP) servers, finding that 9.93% of real-world tools exhibit inconsistencies that create security blind spots.

View →

cs.CLcs.AIRecentJun 2, 2026