Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Han Zhang

Han Zhang

17 indexed papers

Recent (6 mo)
17
With code
0
Influential cites
0
Benchmarked
0

Publications per year

17
26

Top categories

Crypto×10AI×9NLP×6ML×5Software Eng.×1Comp. Eng.×1Info Retrieval×1Society×1

Frequent co-authors

Xiaohan Zhang4×
Mi Zhang2×
Min Yang2×
Zihan Zhang2×
Mohan Zhang2×
Yuqi Jia2×

Research Timeline

2026
Deanonymizing Bitcoin Transactions via Network Traffic Analysis with Semi-supervised Learning

The paper proposes NTSSL, a novel semi-supervised method that combines network traffic analysis and transaction clustering to significantly improve the deanonymization of Bitcoin transactions.

MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors

The paper introduces MGTEVAL, a comprehensive and extensible platform designed to systematically evaluate the performance, robustness, and efficiency of machine-generated text detectors.

When Alignment Isn't Enough: Response-Path Attacks on LLM Agents

This paper introduces the Relay Tampering Attack (RTA), demonstrating that malicious third-party relays can undermine the security of LLM agents by modifying responses post-alignment, even if the LLM itself is perfectly aligned.

Trusted Weights, Treacherous Optimizations? Optimization-Triggered Backdoor Attacks on LLMs

This paper introduces a novel class of backdoor attacks that exploit the numerical side effects of LLM inference optimization, achieving high success rates while maintaining clean accuracy.

A First Measurement Study on Authentication Security in Real-World Remote MCP Servers

This study provides the first measurement of authentication security in real-world remote Model Context Protocol (MCP) servers, finding pervasive and critical authentication weaknesses, particularly in dynamic client registration.

SolarChain: Bridging Physical Law, Verifiable Trust, and Sustainable Markets for Urban Energy Resilience

SolarChain is a platform that ensures verifiable trust in decentralized solar energy markets by anchoring digital energy credits to the hard physical limits of solar yield, thereby preventing data manipulation and speculative trading.

Steering Beyond the Support: Adversarial Training on Unsupervised Jailbroken Activation Simulation

The paper proposes an unsupervised bi-level adversarial training framework to enhance LLM safety steering, achieving strong zero-shot defense against unseen and evolving jailbreak prompts.

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

This study provides the first large-scale measurement of prompt injection attacks in real-world LLM-based resume screening, finding that approximately 1% of resumes contain hidden injections.

Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor

The paper introduces Thinking as Compression (TaC), a novel paradigm showing that the inherent reasoning process of a large language model can naturally compress long context inputs, outperforming dedicated compression methods.

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

This study provides the first systematic measurement of prompt injection attacks in a real-world LLM-based resume screening application, finding that approximately 1% of resumes contain hidden injections.

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory

The paper introduces RHELM, a new benchmark designed to test LLMs' long-term memory by simulating realistic, complex, and evolving dialogues that integrate multiple heterogeneous data sources.

What Makes a Strong Model? A Unified Spectral Analysis of Knowledge Transfer over High-dimensional Linear Regression

This paper develops a unified spectral analysis framework to explain how knowledge transfer (KT) works across different machine learning regimes, such as Knowledge Distillation and Weak-to-Strong generalization.

Can LLM Agents Sustain Long-Horizon Organizational Dynamics?

The paper introduces TaskWeave, a hierarchical agentic framework that successfully simulates long-horizon organizational dynamics by treating coordination as a memory-centered problem, demonstrating that structured memory is key to reliable LLM-based simulations.

MsFEM-Inspired CNNs with Transfer Learning for Multiscale Model Reduction

The paper proposes MITL, an MsFEM-inspired transfer learning strategy for CNN-based reduced-order models, enabling efficient and adaptable approximation of multiscale systems with minimal retraining.

CultureForest: Understanding and Evaluating Cultural Norm Grounded Reasoning in LLMs

The paper introduces CultureForest, a new benchmark for evaluating Cultural Norm Grounded Reasoning in LLMs, demonstrating that models struggle to apply their cultural knowledge effectively in realistic, open-ended scenarios.

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards

QUBRIC introduces a co-design framework that simultaneously optimizes queries and rubrics, overcoming the bottleneck of vague rubrics derived from open-ended questions, leading to significant gains in RL performance.

Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

This paper investigates Description-Code Inconsistency (DCI) in Model Context Protocol (MCP) servers, finding that 9.93% of real-world tools exhibit inconsistencies that create security blind spots.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.SERecentJun 3, 2026

Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

Yutao Shi, Xiaohan Zhang, Xiangjing Zhang, Xihua Shen +4 more

This paper investigates Description-Code Inconsistency (DCI) in Model Context Protocol (MCP) servers, finding that 9.93% of real-world tools exhibit inconsistencies that create security blind spots.

View →
cs.CLcs.AIRecentJun 2, 2026

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards

Rongzhi Zhang, Rui Feng, Zhihan Zhang, Jingfeng Yang +7 more

QUBRIC introduces a co-design framework that simultaneously optimizes queries and rubrics, overcoming the bottleneck of vague rubrics derived from open-ended questions, leading to significant gains in…

View →
cs.CLRecentJun 1, 2026

CultureForest: Understanding and Evaluating Cultural Norm Grounded Reasoning in LLMs

Yangfan Ye, Xiaocheng Feng, Jialong Tang, Xiayu Cao +4 more

The paper introduces CultureForest, a new benchmark for evaluating Cultural Norm Grounded Reasoning in LLMs, demonstrating that models struggle to apply their cultural knowledge effectively in realist…

View →
cs.LGcs.AIRecentMay 31, 2026

What Makes a Strong Model? A Unified Spectral Analysis of Knowledge Transfer over High-dimensional Linear Regression

Wendao Wu, Fangqing Zhang, Haihan Zhang, Cong Fang

This paper develops a unified spectral analysis framework to explain how knowledge transfer (KT) works across different machine learning regimes, such as Knowledge Distillation and Weak-to-Strong gene…

View →
cs.AIRecentMay 31, 2026

Can LLM Agents Sustain Long-Horizon Organizational Dynamics?

Xuancheng Zhu, Yang Yue, Shuaibing Wan, Zihan Dou +3 more

The paper introduces TaskWeave, a hierarchical agentic framework that successfully simulates long-horizon organizational dynamics by treating coordination as a memory-centered problem, demonstrating t…

View →
cs.CERecentMay 31, 2026

MsFEM-Inspired CNNs with Transfer Learning for Multiscale Model Reduction

Xuehan Zhang, Lijian Jiang, Eric T. Chung

The paper proposes MITL, an MsFEM-inspired transfer learning strategy for CNN-based reduced-order models, enabling efficient and adaptable approximation of multiscale systems with minimal retraining.

View →
cs.CLcs.IRRecentMay 29, 2026

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory

Han Zhang, Zihao Tang, Xin Yu, Xiao Liu +7 more

The paper introduces RHELM, a new benchmark designed to test LLMs' long-term memory by simulating realistic, complex, and evolving dialogues that integrate multiple heterogeneous data sources.

View →
cs.CRcs.AIcs.CLRecentMay 27, 2026

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

Mohan Zhang, Yuqi Jia, Zhen Tan, Steven Jiang +3 more

This study provides the first large-scale measurement of prompt injection attacks in real-world LLM-based resume screening, finding that approximately 1% of resumes contain hidden injections.

View →
cs.AIRecentMay 27, 2026

Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor

Guoxin Ma, Yibing Liu, Chengzhengxu Li, Yu Liang +6 more

The paper introduces Thinking as Compression (TaC), a novel paradigm showing that the inherent reasoning process of a large language model can naturally compress long context inputs, outperforming ded…

View →
cs.CRcs.AIcs.CLRecentMay 27, 2026

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening

Mohan Zhang, Yuqi Jia, Zhen Tan, Steven Jiang +3 more

This study provides the first systematic measurement of prompt injection attacks in a real-world LLM-based resume screening application, finding that approximately 1% of resumes contain hidden injecti…

View →
cs.CRcs.LGRecentMay 23, 2026

Steering Beyond the Support: Adversarial Training on Unsupervised Jailbroken Activation Simulation

Luoyu Chen, Weiqi Wang, Zhiyi Tian, Chenhan Zhang +4 more

The paper proposes an unsupervised bi-level adversarial training framework to enhance LLM safety steering, achieving strong zero-shot defense against unseen and evolving jailbreak prompts.

View →
cs.CYcs.CRcs.DCRecentMay 22, 2026

SolarChain: Bridging Physical Law, Verifiable Trust, and Sustainable Markets for Urban Energy Resilience

Shilin Ou, Yifan Xu, Zhenshan Zhang, Luyao Zhang +1 more

SolarChain is a platform that ensures verifiable trust in decentralized solar energy markets by anchoring digital energy credits to the hard physical limits of solar yield, thereby preventing data man…

View →
cs.CRRecentMay 21, 2026

A First Measurement Study on Authentication Security in Real-World Remote MCP Servers

Huijun Zhou, Xiaohan Zhang, Haozhe Zhang, Haoyang Zhang +2 more

This study provides the first measurement of authentication security in real-world remote Model Context Protocol (MCP) servers, finding pervasive and critical authentication weaknesses, particularly i…

View →
cs.CRcs.AIcs.LGRecentMay 20, 2026

Trusted Weights, Treacherous Optimizations? Optimization-Triggered Backdoor Attacks on LLMs

Yifei Wang, Tianlin Li, Xiaohan Zhang, Yida Yang +2 more

This paper introduces a novel class of backdoor attacks that exploit the numerical side effects of LLM inference optimization, achieving high success rates while maintaining clean accuracy.

View →
cs.CRcs.AIRecentMay 4, 2026

When Alignment Isn't Enough: Response-Path Attacks on LLM Agents

Mingyu Luo, Zihan Zhang, Zesen Liu, Yuchong Xie +6 more

This paper introduces the Relay Tampering Attack (RTA), demonstrating that malicious third-party relays can undermine the security of LLM agents by modifying responses post-alignment, even if the LLM…

View →
cs.CRcs.CLRecentApr 28, 2026

MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors

Yuanfan Li, Qi Zhou, Chengzhengxu Li, Zhaohan Zhang +4 more

The paper introduces MGTEVAL, a comprehensive and extensible platform designed to systematically evaluate the performance, robustness, and efficiency of machine-generated text detectors.

View →
cs.CRRecentMar 18, 2026

Deanonymizing Bitcoin Transactions via Network Traffic Analysis with Semi-supervised Learning

Shihan Zhang, Bing Han, Chuanyong Tian, Ruisheng Shi +2 more

The paper proposes NTSSL, a novel semi-supervised method that combines network traffic analysis and transaction clustering to significantly improve the deanonymization of Bitcoin transactions.

View →