Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Zikai Zhang

Zikai Zhang

2 indexed papers

Recent (6 mo)
2
With code
0
Influential cites
0
Benchmarked
0

Publications per year

2
26

Top categories

AI×2Crypto×2NLP×1

Frequent co-authors

Jiahao Xu2×
Rui Hu2×
Olivera Kotevska2×

Research Timeline

2026
SelfGrader: LLM Jailbreak Detection via Anchored Token-Level Logits

SelfGrader proposes a lightweight, robust guardrail for detecting LLM jailbreaks by formulating the detection problem as a numerical grading task using anchored token-level logits, achieving strong performance across various benchmarks.

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

XMark introduces a novel multi-bit watermarking technique that reliably embeds binary messages into LLM-generated text while maintaining high text quality and robust performance even with limited token context.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.CRRecentApr 6, 2026

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

Jiahao Xu, Rui Hu, Olivera Kotevska, Zikai Zhang

XMark introduces a novel multi-bit watermarking technique that reliably embeds binary messages into LLM-generated text while maintaining high text quality and robust performance even with limited toke…

View →
cs.CRcs.AIRecentApr 1, 2026

SelfGrader: LLM Jailbreak Detection via Anchored Token-Level Logits

Zikai Zhang, Rui Hu, Olivera Kotevska, Jiahao Xu

SelfGrader proposes a lightweight, robust guardrail for detecting LLM jailbreaks by formulating the detection problem as a numerical grading task using anchored token-level logits, achieving strong pe…

View →