Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Zongjie Li

Zongjie Li

2 indexed papers

Recent (6 mo)
2
With code
0
Influential cites
0
Benchmarked
0

Publications per year

2
26

Top categories

AI×2Crypto×2Software Eng.×1

Frequent co-authors

Shuai Wang2×
Zimo Ji1×
Wenyuan Jiang1×
Yudong Gao1×
Xunguang Wang1×
Yuguang Zhou1×

Research Timeline

2026
Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

This paper introduces a novel framework, the Reasoning Safety Monitor, to detect and prevent logical inconsistencies and adversarial manipulations within the internal reasoning steps of large language models, establishing reasoning safety as a critical security dimension.

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode

The paper independently stress-tests Claude Code's auto mode permission system using a deliberately ambiguous benchmark, finding that its true false negative rate is significantly higher than reported, particularly due to unmonitored file edits.

Highlighted terms show continued research focus across papers

Papers

cs.SEcs.AIcs.CRRecentApr 4, 2026

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode

Zimo Ji, Zongjie Li, Wenyuan Jiang, Yudong Gao +1 more

The paper independently stress-tests Claude Code's auto mode permission system using a deliberately ambiguous benchmark, finding that its true false negative rate is significantly higher than reported…

View →
cs.AIcs.CRRecentMar 26, 2026

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

Xunguang Wang, Yuguang Zhou, Qingyue Wang, Zongjie Li +4 more

This paper introduces a novel framework, the Reasoning Safety Monitor, to detect and prevent logical inconsistencies and adversarial manipulations within the internal reasoning steps of large language…

View →