Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Ge Zhang

Ge Zhang

3 indexed papers

Recent (6 mo)
3
With code
0
Influential cites
0
Benchmarked
0

Publications per year

3
26

Top categories

AI×3Crypto×1

Frequent co-authors

Bohan Yang1×
Yijun Gong1×
Zhi Zhang1×
Wenpeng Xing1×
Meng Han1×
Junjie Nian1×

Research Timeline

2026
HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?

This paper presents HarmfulSkillBench, a large-scale benchmark demonstrating that even small percentages of publicly available skills can be misused for harmful actions, significantly lowering LLM refusal rates when integrated into agent workflows.

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

TraceGraph introduces a graph-based framework to map agent decision-making across pooled trajectories, revealing hidden differences in agent behavior and improving performance by targeting known failure regions.

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

TriLens is a white-box detector that monitors the entropy of three internal streams (attention, feed-forward, residual) at every layer of a language model to detect hallucinations by tracking how internal certainty forms.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 31, 2026

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

Bohan Yang, Yijun Gong, Zhi Zhang, Ge Zhang +2 more

TriLens is a white-box detector that monitors the entropy of three internal streams (attention, feed-forward, residual) at every layer of a language model to detect hallucinations by tracking how inte…

View →
cs.AIRecentMay 29, 2026

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

Junjie Nian, Kang Chen, Ge Zhang, Yixin Cao +1 more

TraceGraph introduces a graph-based framework to map agent decision-making across pooled trajectories, revealing hidden differences in agent behavior and improving performance by targeting known failu…

View →
cs.CRcs.AIRecentApr 16, 2026

HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?

Yukun Jiang, Yage Zhang, Michael Backes, Xinyue Shen +1 more

This paper presents HarmfulSkillBench, a large-scale benchmark demonstrating that even small percentages of publicly available skills can be misused for harmful actions, significantly lowering LLM ref…

View →