Ge Zhang

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×3Crypto×1

Frequent co-authors

Bohan Yang1×

Yijun Gong1×

Zhi Zhang1×

Wenpeng Xing1×

Meng Han1×

Junjie Nian1×

Research Timeline

2026

HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?

This paper presents HarmfulSkillBench, a large-scale benchmark demonstrating that even small percentages of publicly available skills can be misused for harmful actions, significantly lowering LLM refusal rates when integrated into agent workflows.

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

TraceGraph introduces a graph-based framework to map agent decision-making across pooled trajectories, revealing hidden differences in agent behavior and improving performance by targeting known failure regions.

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

TriLens is a white-box detector that monitors the entropy of three internal streams (attention, feed-forward, residual) at every layer of a language model to detect hallucinations by tracking how internal certainty forms.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 31, 2026

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

Bohan Yang, Yijun Gong, Zhi Zhang, Ge Zhang +2 more

View →

cs.AIRecentMay 29, 2026