Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yuan

Yuan

50 indexed papers

Recent (6 mo)
50
With code
0
Influential cites
0
Benchmarked
0

Publications per year

50
26

Top categories

AI×28ML×16Crypto×12NLP×11Vision×10Robotics×4Info Retrieval×3Distributed×3

Frequent co-authors

Han Li3×
Pengyuan Liu3×
Yufei Ye2×
Hao Peng2×
Jinghuai Zhang2×
Kunlin Cai2×

Research Timeline

2026
Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

The paper proposes Skill-RM, a unified framework that treats reward modeling as an agentic task to consistently integrate diverse evaluation criteria, achieving superior performance over traditional methods.

VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring

VLESA is a novel framework that monitors human activities from egocentric video to predict and intervene in dangerous actions by incorporating goal-conditioned safety checks based on inferred intent.

Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

The paper introduces a Contextual Integrity (CI) framework and a new benchmark (DelegateCI-Bench) to rewrite user queries sent to cloud LLMs, ensuring only task-essential information is retained while preserving utility and maximizing privacy.

NeuroArmor: Safe-Variant-Guided Representation Consistency for Selective Re-Anchoring in Jailbreak Defense

NeuroArmor is a white-box runtime defense that uses prompt-specific safe variants to selectively detect and mitigate jailbreak attacks, significantly reducing attack success rates while maintaining a low false positive rate.

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

The paper introduces HERALD, a token-level cryptographic redaction framework that encrypts only sensitive tokens in clinical text, enabling privacy-preserving LLM deployment without significant loss of utility.

ImageAuditor: Membership Inference Attack against Image-based Retrieval-Augmented Generation

ImageAuditor introduces a novel Membership Inference Attack (MIA) specifically designed for Image-based Retrieval-Augmented Generation (IRAG) systems, achieving high accuracy by addressing cross-modal retrieval and discriminative signal extraction challenges.

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

This paper introduces CHERRL, a controllable hacking environment for rubric-based reinforcement learning to study and mitigate reward hacking.

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

This paper presents GRAIL, a digital generation pipeline that synthesizes human-object interactions for humanoid robots.

Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

The paper proposes a novel method using fully homomorphic encryption (FHE) to learn causal structures while preserving data privacy, achieving high consistency and practical efficiency.

What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems

The paper introduces and analyzes cross-session stored prompt injection, demonstrating that persistent system state transforms prompt injection from a temporary model-level threat into a long-lived, system-level vulnerability in agentic systems.

Pepper: High-bandwidth and Scalable Anonymous Broadcast with Cryptographic Privacy

Pepper is a novel, high-bandwidth anonymous broadcast protocol that achieves cryptographic sender anonymity and significantly improves messaging throughput compared to existing state-of-the-art systems.

DPDL: Towards Differential Privacy Preservation in Decentralized Stochastic Learning on Non-IID Data

The paper proposes DPDL, a novel differential privacy algorithm for decentralized stochastic learning on non-IID data, which uses similarity-based calibration of perturbed cross-gradients to achieve privacy preservation and maintain training efficiency.

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

MLEvolve is a novel self-evolving multi-agent framework that enables LLM agents to discover and optimize machine learning algorithms for complex, long-horizon tasks.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

Bridging the Semantic-Collaborative Gap: An Asymmetric Graph Architecture for Cold-Start Item Recommendation

The paper proposes Shallow-RHS, an asymmetric graph-completion model, to solve the cold-start problem for both new content and new devices in large-scale recommendation systems.

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

The paper proposes an iCEM+TL framework that combines the Sample-efficient Cross-Entropy Method with Transfer Learning and Reward Redesign to improve robotic motion planning for complex tasks like stacking and shelf placement.

Ensuring Interaction Safety in Multitask Exoskeleton Control: A Simulation-Trained Variable Impedance Framework

The paper proposes a simulation-trained variable impedance control framework for wearable exoskeletons that safely and effectively augments human physical capabilities across multiple tasks.

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

This paper proposes a training-free framework called ReasonAlloc to mitigate inference bottlenecks in large language models by recasting decoding-time key-value compression as a hierarchical budget allocation problem.

Reconfigurable Antennas for Next-generation Mobile Communication Networks: A Comprehensive Survey and Tutorial

This paper presents a comprehensive survey on reconfigurable antennas for next-generation mobile networks, focusing on their potential and applications.

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

肖代替了视觉令牌的永久删除,通过可恢复的路由来改进视觉语言模型的性能

Highlighted terms show continued research focus across papers

Papers

cs.ITSurveyRecentJun 10, 2026

Reconfigurable Antennas for Next-generation Mobile Communication Networks: A Comprehensive Survey and Tutorial

Yizhe Zhao, Long Zhang, Halvin Yang, Kun Yang +3 more

This paper presents a comprehensive survey on reconfigurable antennas for next-generation mobile networks, focusing on their potential and applications.

View →
cs.CVcs.AIEmpirical
Recent
Jun 10, 2026

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

Cheng-Yu Yang, Shao-Yuan Lo, Yu-Lun Liu

肖代替了视觉令牌的永久删除,通过可恢复的路由来改进视觉语言模型的性能

View →
cs.AIEmpiricalRecentJun 9, 2026

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

Wenhao Liu, Hao Shi, Yunhe Li, Weizhi Fei +6 more

This paper proposes a training-free framework called ReasonAlloc to mitigate inference bottlenecks in large language models by recasting decoding-time key-value compression as a hierarchical budget al…

View →
cs.AIcs.CLRecentJun 4, 2026

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

Shangheng Du, Xiangchao Yan, Jinxin Shi, Zongsheng Cao +10 more

MLEvolve is a novel self-evolving multi-agent framework that enables LLM agents to discover and optimize machine learning algorithms for complex, long-horizon tasks.

View →
cs.IRcs.AIcs.CLRecentJun 4, 2026

OneReason Technical Report

OneRec Team, Biao Yang, Boyang Ding, Chenglong Chu +80 more

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coheren…

View →
cs.IRcs.AIcs.LGRecentJun 4, 2026

Bridging the Semantic-Collaborative Gap: An Asymmetric Graph Architecture for Cold-Start Item Recommendation

Anh Truong, John Trenkle, Yuanbo Chen, Honghong Zhao +3 more

The paper proposes Shallow-RHS, an asymmetric graph-completion model, to solve the cold-start problem for both new content and new devices in large-scale recommendation systems.

View →
cs.ROcs.AIcs.NERecentJun 4, 2026

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

Yuanzhi He, Victor Romero-Cano, José J. Patiño, Juan David Hernández +2 more

The paper proposes an iCEM+TL framework that combines the Sample-efficient Cross-Entropy Method with Transfer Learning and Reward Redesign to improve robotic motion planning for complex tasks like sta…

View →
cs.RORecentJun 4, 2026

Ensuring Interaction Safety in Multitask Exoskeleton Control: A Simulation-Trained Variable Impedance Framework

Muyuan Ma, Houcheng Li, Haotian Zhai, Lijun Han +3 more

The paper proposes a simulation-trained variable impedance control framework for wearable exoskeletons that safely and effectively augments human physical capabilities across multiple tasks.

View →
cs.LGcs.AIcs.CLRecentJun 3, 2026

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Xuekang Wang, Zhuoyuan Hao, Shuo Hou, Hao Peng +2 more

This paper introduces CHERRL, a controllable hacking environment for rubric-based reinforcement learning to study and mitigate reward hacking.

View →
cs.RORecentJun 3, 2026

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

Tianyi Xie, Haotian Zhang, Jinhyung Park, Zi Wang +16 more

This paper presents GRAIL, a digital generation pipeline that synthesizes human-object interactions for humanoid robots.

View →
cs.CRcs.LGRecentJun 3, 2026

Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

Jian Yang, Yuan Tong, Qinbin Li, Zeyi Wen +1 more

The paper proposes a novel method using fully homomorphic encryption (FHE) to learn causal structures while preserving data privacy, achieving high consistency and practical efficiency.

View →
cs.CRcs.AIRecentJun 3, 2026

What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems

Yuanbo Xie, Tianyun Liu, Yingjie Zhang, Suchen Liu +3 more

The paper introduces and analyzes cross-session stored prompt injection, demonstrating that persistent system state transforms prompt injection from a temporary model-level threat into a long-lived, s…

View →
cs.CRRecentJun 3, 2026

Pepper: High-bandwidth and Scalable Anonymous Broadcast with Cryptographic Privacy

Chenghao Li, Haoyuan Wang, Xianghang Mi

Pepper is a novel, high-bandwidth anonymous broadcast protocol that achieves cryptographic sender anonymity and significantly improves messaging throughput compared to existing state-of-the-art system…

View →
cs.LGcs.CRRecentJun 3, 2026

DPDL: Towards Differential Privacy Preservation in Decentralized Stochastic Learning on Non-IID Data

Yunsheng Yuan, Xue Xiao, Lina Wang, Feng Li

The paper proposes DPDL, a novel differential privacy algorithm for decentralized stochastic learning on non-IID data, which uses similarity-based calibration of perturbed cross-gradients to achieve p…

View →
cs.LGcs.CLRecentJun 2, 2026

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Tao Chen, Gangwei Jiang, Pengyu Cheng, Siyuan Huang +9 more

The paper proposes Skill-RM, a unified framework that treats reward modeling as an agentic task to consistently integrate diverse evaluation criteria, achieving superior performance over traditional m…

View →
cs.CVcs.LGcs.RORecentJun 2, 2026

VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring

Hanjiang Hu, Yiyuan Pan, Jiaxing Li, Xusheng Luo +4 more

VLESA is a novel framework that monitors human activities from egocentric video to predict and intervene in dangerous actions by incorporating goal-conditioned safety checks based on inferred intent.

View →
cs.CRcs.AIRecentJun 2, 2026

Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

Xinyue Huang, Xiaochun Cao, Wenyuan Yang

The paper introduces a Contextual Integrity (CI) framework and a new benchmark (DelegateCI-Bench) to rewrite user queries sent to cloud LLMs, ensuring only task-essential information is retained while…

View →
cs.CRcs.AIRecentJun 2, 2026

NeuroArmor: Safe-Variant-Guided Representation Consistency for Selective Re-Anchoring in Jailbreak Defense

Zhongyang Lin, Ziran Zhao, Feifei Zhai, Pengyuan Liu

NeuroArmor is a white-box runtime defense that uses prompt-specific safe variants to selectively detect and mitigate jailbreak attacks, significantly reducing attack success rates while maintaining a…

View →
cs.CLcs.CRRecentJun 2, 2026

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

Farhan Sheth, Ziyuan Yang, Yongying Lan, Si Yong Yeo

The paper introduces HERALD, a token-level cryptographic redaction framework that encrypts only sensitive tokens in clinical text, enabling privacy-preserving LLM deployment without significant loss o…

View →
cs.CRRecentJun 2, 2026

ImageAuditor: Membership Inference Attack against Image-based Retrieval-Augmented Generation

Jinghuai Zhang, Pengyue Yu, Zhexiao Lin, Kunlin Cai +2 more

ImageAuditor introduces a novel Membership Inference Attack (MIA) specifically designed for Image-based Retrieval-Augmented Generation (IRAG) systems, achieving high accuracy by addressing cross-modal…

View →