Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Rui Hu

Rui Hu

4 indexed papers

Recent (6 mo)
4
With code
0
Influential cites
0
Benchmarked
0

Publications per year

4
26

Top categories

AI×4NLP×3Crypto×2HCI×1ML×1

Frequent co-authors

Jiahao Xu2×
Olivera Kotevska2×
Zikai Zhang2×
Jun Rui Huang1×
Wang Bill Zhu1×
Ziyi Liu1×

Research Timeline

2026
SelfGrader: LLM Jailbreak Detection via Anchored Token-Level Logits

SelfGrader proposes a lightweight, robust guardrail for detecting LLM jailbreaks by formulating the detection problem as a numerical grading task using anchored token-level logits, achieving strong performance across various benchmarks.

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

XMark introduces a novel multi-bit watermarking technique that reliably embeds binary messages into LLM-generated text while maintaining high text quality and robust performance even with limited token context.

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

The paper introduces MemTrace, a framework that treats LLM memory pipelines as traceable graphs to systematically diagnose and automatically correct memory-related errors, boosting performance by up to 7.62%.

EUDAIMONIA: Evaluating Undesirable Dynamics in AI

The paper introduces EUDAIMONIA, a new framework and benchmark for evaluating how well LLMs align with user welfare in social interactions, finding that even state-of-the-art models frequently violate social-alignment requirements.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.HCRecentMay 28, 2026

EUDAIMONIA: Evaluating Undesirable Dynamics in AI

Jun Rui Huang, Wang Bill Zhu, Ziyi Liu, Nathanael Fast +2 more

The paper introduces EUDAIMONIA, a new framework and benchmark for evaluating how well LLMs align with user welfare in social interactions, finding that even state-of-the-art models frequently violate…

View →
cs.CLcs.AIcs.LGRecentMay 27, 2026

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

Xinle Deng, Ruobin Zhong, Hujin Peng, Xiaoben Lu +14 more

The paper introduces MemTrace, a framework that treats LLM memory pipelines as traceable graphs to systematically diagnose and automatically correct memory-related errors, boosting performance by up t…

View →
cs.CLcs.AIcs.CRRecentApr 6, 2026

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

Jiahao Xu, Rui Hu, Olivera Kotevska, Zikai Zhang

XMark introduces a novel multi-bit watermarking technique that reliably embeds binary messages into LLM-generated text while maintaining high text quality and robust performance even with limited toke…

View →
cs.CRcs.AIRecentApr 1, 2026

SelfGrader: LLM Jailbreak Detection via Anchored Token-Level Logits

Zikai Zhang, Rui Hu, Olivera Kotevska, Jiahao Xu

SelfGrader proposes a lightweight, robust guardrail for detecting LLM jailbreaks by formulating the detection problem as a numerical grading task using anchored token-level logits, achieving strong pe…

View →