Qi Zhu

9 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×8Crypto×5ML×3NLP×1Info Retrieval×1Vision×1

Frequent co-authors

Daoqiang Zhang3×

Fei Mi2×

Hongning Wang2×

Minlie Huang2×

Yinbo Yu2×

Jing Fang2×

Research Timeline

2026

LoopTrap: Termination Poisoning Attacks on LLM Agents

The paper introduces LoopTrap, an automated red-teaming framework that demonstrates how malicious prompts can poison the termination judgment of LLM agents, causing unbounded computation.

Fast and Lightweight Backdoor Detection via Head Random Probing

The paper proposes HTell, a fast and lightweight data-free backdoor detector that analyzes the abnormal response concentration of backdoored models on the target class using random latent probes applied directly to the prediction head.

Lightweight and Fast Backdoor Model Detection

The paper proposes DFBScanner, a lightweight static parameter inspection framework that detects backdoor attacks by analyzing anomalous parameter updates in the final classification layer, achieving fast and generalizable detection.

You Live More Than Once: Towards Hierarchical Skill Meta-Evolving

The paper proposes HiSME, a lightweight hierarchical skill meta-evolving solution that jointly optimizes skills and the skill evolving strategy by learning meta-skills from task execution traces, leading to improved agent performance.

Exploring Autonomous Agentic Data Engineering for Model Specialization

The paper introduces Autonomous Agentic Data Engineering, demonstrating that LLMs can autonomously plan and optimize end-to-end data curation pipelines, leading to substantial performance gains in specialized models.

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

VISUALTHINK-VLA introduces a visual intermediate-reasoning framework that guides action prediction using compact visual evidence, achieving high accuracy and significantly low latency for real-time Vision-Language-Action policies.

When AI Meets Wall Street: A Survey on Trustworthy AI in Fintech

This paper proposes a unified, lifecycle-centric framework and a detailed taxonomy to survey and analyze novel, finance-specific attack surfaces and vulnerabilities in AI systems used within the financial sector.

Physically-Constrained Mamba-SDE for Remaining Useful Life Prediction under Irregular Observations

The paper proposes PC-MambaSDE, a physically-constrained continuous-time framework that accurately predicts Remaining Useful Life (RUL) despite irregular sensor observations and ensures physically plausible degradation trajectories.

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

The paper introduces RUBAS, a rubric-based reinforcement learning framework that improves agent safety by providing fine-grained, multi-dimensional rewards for complex tool-use scenarios.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIcs.CRRecentJun 2, 2026

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

Xian Qi Loye, Qinglin Su, Zhexin Zhang, Shiyao Cui +4 more

The paper introduces RUBAS, a rubric-based reinforcement learning framework that improves agent safety by providing fine-grained, multi-dimensional rewards for complex tool-use scenarios.

View →

cs.AIRecentJun 1, 2026