Rui Hu

4 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×4NLP×3Crypto×2HCI×1ML×1

Frequent co-authors

Jiahao Xu2×

Olivera Kotevska2×

Zikai Zhang2×

Jun Rui Huang1×

Wang Bill Zhu1×

Ziyi Liu1×

Research Timeline

2026

SelfGrader: LLM Jailbreak Detection via Anchored Token-Level Logits

SelfGrader proposes a lightweight, robust guardrail for detecting LLM jailbreaks by formulating the detection problem as a numerical grading task using anchored token-level logits, achieving strong performance across various benchmarks.

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

XMark introduces a novel multi-bit watermarking technique that reliably embeds binary messages into LLM-generated text while maintaining high text quality and robust performance even with limited token context.

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

The paper introduces MemTrace, a framework that treats LLM memory pipelines as traceable graphs to systematically diagnose and automatically correct memory-related errors, boosting performance by up to 7.62%.

EUDAIMONIA: Evaluating Undesirable Dynamics in AI

The paper introduces EUDAIMONIA, a new framework and benchmark for evaluating how well LLMs align with user welfare in social interactions, finding that even state-of-the-art models frequently violate social-alignment requirements.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.HCRecentMay 28, 2026

EUDAIMONIA: Evaluating Undesirable Dynamics in AI

Jun Rui Huang, Wang Bill Zhu, Ziyi Liu, Nathanael Fast +2 more

View →

cs.CLcs.AIcs.LGRecentMay 27, 2026