Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yuan Chen

Yuan Chen

13 indexed papers

Recent (6 mo)
13
With code
0
Influential cites
0
Benchmarked
0

Publications per year

13
26

Top categories

AI×7Crypto×7ML×2Comp. Eng.×2Software Eng.×2Sound×1NLP×1Robotics×1

Frequent co-authors

Boyuan Chen3×
Xipeng Qiu2×
Shengchen Ling2×
Yihang Huang2×
Yajin Zhou2×
Lei Wu2×

Research Timeline

2026
Do Privacy Policies Match with the Logs? An Empirical Study of Privacy Disclosure in Android Application Logs

This study empirically analyzed 1,000 Android apps, finding that privacy policies are often vague and frequently fail to align with the actual sensitive data logged by the applications.

RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

The paper introduces RAVEN, a Retrieval-Augmented Vulnerability Exploration Network, which uses LLM agents and RAG to automatically generate comprehensive, structured vulnerability analysis reports for vulnerable code.

Semantics-Based Verification of an Implemented Shor Oracle for ECDLP in Qrisp

The paper introduces a semantics-first verification framework for an implemented Shor oracle for ECDLP in Qrisp, demonstrating that even seemingly correct implementations can fail due to subtle control law violations.

Pop Quiz Attack: Black-box Membership Inference Attacks Against Large Language Models

The PopQuiz Attack is a novel black-box membership inference attack that successfully tests whether large language models memorize specific training data by framing the target data as multiple-choice quiz questions.

RADAR: Defending RAG Dynamically against Retrieval Corruption

The paper proposes RADAR, a novel graph-based framework that dynamically defends Retrieval-Augmented Generation (RAG) systems against evolving adversarial attacks while minimizing storage overhead.

SafeMed-R1: Clinician-Audited Safety and Ethics Alignment for Medical Large Language Models

The paper introduces SafeMed-R1, a clinically audited LLM that significantly improves safety and ethical alignment for medical applications, matching or exceeding resident performance on safety-critical tasks.

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation

The paper proposes Agentic ASR, a closed-loop framework that treats ASR as a multi-turn refinement task, significantly improving semantic accuracy over traditional token-level metrics.

MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models

The paper introduces MiraBench, a new benchmark that evaluates the action-conditioned reliability of robotic world models, finding that visual fidelity is insufficient and that optimism bias is a pervasive issue across current systems.

Extreme dynamic symmetry enables omnidirectional and multifunctional robots

The paper introduces and demonstrates that leveraging dynamic symmetry—the uniformity of attainable center-of-mass accelerations—significantly enhances a robot's agility, robustness, and multifunctionality across various challenging environments.

Free-Riding in the AI Economy: Demystifying Logic Flaws in x402-Enabled Payment Systems

This paper analyzes the x402 payment protocol, revealing critical synchronization and security flaws that allow attackers to exploit payment systems and force merchants to subsidize compute costs.

Free-Riding in the AI Economy: Demystifying Logic Flaws in x402-Enabled Payment Systems

This paper analyzes the x402 payment protocol, revealing systemic vulnerabilities in state synchronization and signature design that allow attackers to exploit payment systems for resource leakage in the AI economy.

SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training

SIRI introduces a self-internalizing reinforcement learning framework that allows LLM agents to autonomously discover and integrate reusable skills directly into their core policy, significantly improving performance on complex tasks without external skill generators.

MOSS-Audio Technical Report

MOSS-Audio is a unified audio-language model designed for comprehensive understanding of speech, environmental sounds, and music, achieving strong performance across various audio-grounded tasks.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.LGRecentJun 1, 2026

SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training

Zhongyu He, Yuanfan Li, Fei Huang, Tianyu Chen +8 more

SIRI introduces a self-internalizing reinforcement learning framework that allows LLM agents to autonomously discover and integrate reusable skills directly into their core policy, significantly impro…

View →
cs.SDcs.AIRecentJun 1, 2026

MOSS-Audio Technical Report

Chen Yang, Chufan Yu, Hanfu Chen, Jie Zhu +21 more

MOSS-Audio is a unified audio-language model designed for comprehensive understanding of speech, environmental sounds, and music, achieving strong performance across various audio-grounded tasks.

View →
cs.CRcs.CERecentMay 29, 2026

Free-Riding in the AI Economy: Demystifying Logic Flaws in x402-Enabled Payment Systems

Shengchen Ling, Yihang Huang, Yuan Chen, Yajin Zhou +2 more

This paper analyzes the x402 payment protocol, revealing critical synchronization and security flaws that allow attackers to exploit payment systems and force merchants to subsidize compute costs.

View →
cs.CRcs.CERecentMay 29, 2026

Free-Riding in the AI Economy: Demystifying Logic Flaws in x402-Enabled Payment Systems

Shengchen Ling, Yihang Huang, Yuan Chen, Yajin Zhou +2 more

This paper analyzes the x402 payment protocol, revealing systemic vulnerabilities in state synchronization and signature design that allow attackers to exploit payment systems for resource leakage in…

View →
cs.AIcs.CLRecentMay 28, 2026

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation

Zixuan Jiang, Yanqiao Zhu, Peng Wang, Qinyuan Chen +7 more

The paper proposes Agentic ASR, a closed-loop framework that treats ASR as a multi-turn refinement task, significantly improving semantic accuracy over traditional token-level metrics.

View →
cs.AIRecentMay 28, 2026

MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models

Tianzhuo Yang, Zihan Shen, Zirui Mi, Zhaoyi Zhang +6 more

The paper introduces MiraBench, a new benchmark that evaluates the action-conditioned reliability of robotic world models, finding that visual fidelity is insufficient and that optimism bias is a perv…

View →
cs.ROcs.AIRecentMay 28, 2026

Extreme dynamic symmetry enables omnidirectional and multifunctional robots

Jiaxun Liu, Boxi Xia, Boyuan Chen

The paper introduces and demonstrates that leveraging dynamic symmetry—the uniformity of attainable center-of-mass accelerations—significantly enhances a robot's agility, robustness, and multifunction…

View →
cs.AIRecentMay 27, 2026

SafeMed-R1: Clinician-Audited Safety and Ethics Alignment for Medical Large Language Models

Chao Ding, Mouxiao Bian, Tianbin Li, Minjia Yuan +11 more

The paper introduces SafeMed-R1, a clinically audited LLM that significantly improves safety and ethical alignment for medical applications, matching or exceeding resident performance on safety-critic…

View →
cs.CRcs.LGRecentMay 21, 2026

RADAR: Defending RAG Dynamically against Retrieval Corruption

Ziyuan Chen, Yueming Lyu, Yi Liu, Weixiang Han +3 more

The paper proposes RADAR, a novel graph-based framework that dynamically defends Retrieval-Augmented Generation (RAG) systems against evolving adversarial attacks while minimizing storage overhead.

View →
cs.CRRecentMay 7, 2026

Pop Quiz Attack: Black-box Membership Inference Attacks Against Large Language Models

Zeyuan Chen, Yihan Ma, Xinyue Shen, Michael Backes +1 more

The PopQuiz Attack is a novel black-box membership inference attack that successfully tests whether large language models memorize specific training data by framing the target data as multiple-choice…

View →
cs.SEcs.CRquant-phRecentMay 1, 2026

Semantics-Based Verification of an Implemented Shor Oracle for ECDLP in Qrisp

Lei Zhang, Zhiyuan Chen

The paper introduces a semantics-first verification framework for an implemented Shor oracle for ECDLP in Qrisp, demonstrating that even seemingly correct implementations can fail due to subtle contro…

View →
cs.CRcs.SERecentApr 20, 2026

Do Privacy Policies Match with the Logs? An Empirical Study of Privacy Disclosure in Android Application Logs

Zhiyuan Chen, Love Jayesh Ahir, Ahmad Suleiman, Kundi Yao +3 more

This study empirically analyzed 1,000 Android apps, finding that privacy policies are often vague and frequently fail to align with the actual sensitive data logged by the applications.

View →
cs.CRcs.AIcs.MARecentApr 20, 2026

RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

Parteek Jamwal, Minghao Shao, Boyuan Chen, Achyuta Muthuvelan +14 more

The paper introduces RAVEN, a Retrieval-Augmented Vulnerability Exploration Network, which uses LLM agents and RAG to automatically generate comprehensive, structured vulnerability analysis reports fo…

View →