Kai Zhang

11 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×8Crypto×5Robotics×2Vision×2Distributed×1Networking×1Systems and Control×1Multiagent×1

Frequent co-authors

Jingkai Zhang2×

Jiahao Xu2×

Rui Hu2×

Olivera Kotevska2×

Zikai Zhang2×

Mingxin Li1×

Research Timeline

2026

SelfGrader: LLM Jailbreak Detection via Anchored Token-Level Logits

SelfGrader proposes a lightweight, robust guardrail for detecting LLM jailbreaks by formulating the detection problem as a numerical grading task using anchored token-level logits, achieving strong performance across various benchmarks.

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

XMark introduces a novel multi-bit watermarking technique that reliably embeds binary messages into LLM-generated text while maintaining high text quality and robust performance even with limited token context.

ClawGuard: Out-of-Band Detection of LLM Agent Workflow Hijacking via EM Side Channel

ClawGuard introduces a passive, out-of-band security monitor that detects LLM agent workflow hijacking by analyzing unique electromagnetic (EM) emanations generated during agent skill execution.

MT-JailBench: A Modular Benchmark for Understanding Multi-Turn Jailbreak Attacks

The paper introduces MT-JailBench, a modular framework for evaluating multi-turn jailbreaks, demonstrating that controlling experimental components like prompt generation and resource budgets is crucial for fair comparison and understanding attack success.

From Fact Overwriting to Knowledge Evolution: Causal Editing via On-Policy Self-Distillation

The paper introduces Causal Editing (CODE), a new paradigm that improves knowledge updates in LLMs by grounding fact injection in causal narratives, drastically reducing self-refutation rates.

CardioLens: Revealing the Clinical Reality Gap of MLLMs via Multi-Sequence Cardiac MRI Evaluations

The paper introduces CardioLens, a rigorous evaluation testbed for multi-sequence Cardiac MRI, which reveals that current Multimodal Large Language Models (MLLMs) exhibit a significant 'clinical reality gap' and perform poorly when simulating real-world cardiac interpretation workflows.

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

The paper introduces Humanoid-GPT, a large-scale generative Transformer model that achieves robust zero-shot motion tracking and control by training on a massive, unified corpus of motion data.

From Agent Traces to Trust: Evidence Tracing and Execution Provenance in LLM Agents

This survey provides a systematic framework and taxonomy for evidence tracing and execution provenance in LLM agents, addressing the difficulty of verifying and auditing complex agent behaviors.

Welfarist Control Design -- How to fulfill the societal mandate in multi-agent control?

This paper explores tools for control engineers to design socio-technical systems in a more principled and ethical manner, using feedback optimization, control of Markov decision processes, and model predictive control.

Chronos: A Physics-Informed Full-History Framework for Non-Markovian Long-Horizon Manipulation

This paper introduces Chronos, a physics-informed framework for non-Markovian long-horizon manipulation, which elevates observation history to the latent state of the policy dynamics and achieves higher success rates and fewer parameters than Markovian VLA baselines in both simulated and real-world experiments.

Scalable LLM Agent Tool Access in the Cloud

A cloud-scale gateway system for MCP services is presented, which breaks the direct-connect model and offloads legacy service integration, consolidates incompatible MCP variants, and reduces tool selection time and token usage.

Highlighted terms show continued research focus across papers

Papers

cs.DCcs.AIcs.NIEmpiricalRecentJul 17, 2026

Scalable LLM Agent Tool Access in the Cloud

Mingxin Li, Enge Song, Yueshang Zuo, Xiaodong Liu +26 more

View →

cs.ROEmpirical