Yuan

50 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×28ML×16Crypto×12NLP×11Vision×10Robotics×4Info Retrieval×3Distributed×3

Frequent co-authors

Han Li3×

Pengyuan Liu3×

Yufei Ye2×

Hao Peng2×

Jinghuai Zhang2×

Kunlin Cai2×

Research Timeline

2026

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

The paper proposes Skill-RM, a unified framework that treats reward modeling as an agentic task to consistently integrate diverse evaluation criteria, achieving superior performance over traditional methods.

VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring

VLESA is a novel framework that monitors human activities from egocentric video to predict and intervene in dangerous actions by incorporating goal-conditioned safety checks based on inferred intent.

Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

The paper introduces a Contextual Integrity (CI) framework and a new benchmark (DelegateCI-Bench) to rewrite user queries sent to cloud LLMs, ensuring only task-essential information is retained while preserving utility and maximizing privacy.

NeuroArmor: Safe-Variant-Guided Representation Consistency for Selective Re-Anchoring in Jailbreak Defense

NeuroArmor is a white-box runtime defense that uses prompt-specific safe variants to selectively detect and mitigate jailbreak attacks, significantly reducing attack success rates while maintaining a low false positive rate.

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

The paper introduces HERALD, a token-level cryptographic redaction framework that encrypts only sensitive tokens in clinical text, enabling privacy-preserving LLM deployment without significant loss of utility.

ImageAuditor: Membership Inference Attack against Image-based Retrieval-Augmented Generation

ImageAuditor introduces a novel Membership Inference Attack (MIA) specifically designed for Image-based Retrieval-Augmented Generation (IRAG) systems, achieving high accuracy by addressing cross-modal retrieval and discriminative signal extraction challenges.

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

This paper introduces CHERRL, a controllable hacking environment for rubric-based reinforcement learning to study and mitigate reward hacking.

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

This paper presents GRAIL, a digital generation pipeline that synthesizes human-object interactions for humanoid robots.

Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

The paper proposes a novel method using fully homomorphic encryption (FHE) to learn causal structures while preserving data privacy, achieving high consistency and practical efficiency.

What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems

The paper introduces and analyzes cross-session stored prompt injection, demonstrating that persistent system state transforms prompt injection from a temporary model-level threat into a long-lived, system-level vulnerability in agentic systems.

Pepper: High-bandwidth and Scalable Anonymous Broadcast with Cryptographic Privacy

Pepper is a novel, high-bandwidth anonymous broadcast protocol that achieves cryptographic sender anonymity and significantly improves messaging throughput compared to existing state-of-the-art systems.

DPDL: Towards Differential Privacy Preservation in Decentralized Stochastic Learning on Non-IID Data

The paper proposes DPDL, a novel differential privacy algorithm for decentralized stochastic learning on non-IID data, which uses similarity-based calibration of perturbed cross-gradients to achieve privacy preservation and maintain training efficiency.

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

MLEvolve is a novel self-evolving multi-agent framework that enables LLM agents to discover and optimize machine learning algorithms for complex, long-horizon tasks.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

Bridging the Semantic-Collaborative Gap: An Asymmetric Graph Architecture for Cold-Start Item Recommendation

The paper proposes Shallow-RHS, an asymmetric graph-completion model, to solve the cold-start problem for both new content and new devices in large-scale recommendation systems.

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

The paper proposes an iCEM+TL framework that combines the Sample-efficient Cross-Entropy Method with Transfer Learning and Reward Redesign to improve robotic motion planning for complex tasks like stacking and shelf placement.

Ensuring Interaction Safety in Multitask Exoskeleton Control: A Simulation-Trained Variable Impedance Framework

The paper proposes a simulation-trained variable impedance control framework for wearable exoskeletons that safely and effectively augments human physical capabilities across multiple tasks.

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

This paper proposes a training-free framework called ReasonAlloc to mitigate inference bottlenecks in large language models by recasting decoding-time key-value compression as a hierarchical budget allocation problem.

Reconfigurable Antennas for Next-generation Mobile Communication Networks: A Comprehensive Survey and Tutorial

This paper presents a comprehensive survey on reconfigurable antennas for next-generation mobile networks, focusing on their potential and applications.

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

肖代替了视觉令牌的永久删除，通过可恢复的路由来改进视觉语言模型的性能

Highlighted terms show continued research focus across papers

Papers

cs.ITSurveyRecentJun 10, 2026

Reconfigurable Antennas for Next-generation Mobile Communication Networks: A Comprehensive Survey and Tutorial

Yizhe Zhao, Long Zhang, Halvin Yang, Kun Yang +3 more

This paper presents a comprehensive survey on reconfigurable antennas for next-generation mobile networks, focusing on their potential and applications.

View →

cs.CVcs.AIEmpirical