Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yan Li

Yan Li

19 indexed papers

Recent (6 mo)
19
With code
0
Influential cites
0
Benchmarked
0

Publications per year

19
26

Top categories

AI×14Crypto×9NLP×3ML×3Vision×2Info Retrieval×1Robotics×1Multiagent×1

Frequent co-authors

Ziyan Liu3×
Shiyan Liu2×
Zhezheng Hao2×
Hong Wang2×
Yan Lin Aung2×
Wei Yang Bryan Lim2×

Research Timeline

2026
Estimating the Social Cost of Corporate Data Breaches

This study estimates the true social cost of corporate data breaches by quantifying the direct financial and opportunity costs to victims, finding that these costs can significantly exceed corporate settlements, though the marginal social cost per victim appears to be declining over time.

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

This paper conducts the first real-world safety evaluation of the personal AI agent OpenClaw, demonstrating that its broad system access creates inherent vulnerabilities that significantly increase the attack success rate regardless of the underlying large language model.

E-MIA: Exam-Style Black-Box Membership Inference Attacks against RAG Systems

E-MIA introduces a novel, stealthy black-box membership inference attack that converts verifiable hard evidence within a candidate document into an objective, multi-part exam score to determine if the document was ingested into a RAG knowledge base.

Tracing the Dynamics of Refusal: Exploiting Latent Refusal Trajectories for Robust Jailbreak Detection

The paper proposes SALO, a novel detector that monitors the dynamic, layer-wise activation pattern (Refusal Trajectory) to improve jailbreak detection robustness compared to traditional methods relying on static terminal representations.

FraudBench: A Multimodal Benchmark for Detecting AI-Generated Fraudulent Refund Evidence

The paper introduces FraudBench, a multimodal benchmark designed to detect AI-generated fraudulent refund evidence, finding that current AI models struggle significantly with claim-conditioned fake-damage detection.

From Compression to Accountability: Harmless Copyright Protection for Dataset Distillation

The paper proposes SubPopMark, a novel subpopulation-driven framework that injects harmless, verifiable markers into distilled datasets to prevent copyright infringement and data leakage.

LymphNode: A Plug-and-Play Access Control Method for Deep Neural Networks

LymphNode is a novel, post-hoc access control framework that protects Deep Neural Networks (DNNs) from model extraction and inversion attacks by enforcing a default-deny policy and selectively restoring utility only for authorized queries.

BYOT-CPS: A Hybrid Cyber-Physical Systems Testbed for IoT Security Assessment and Platform Evaluation

The paper introduces BYOT-CPS, a hybrid cyber-physical testbed that bridges the gap between purely simulated and purely physical IoT testing environments, enabling realistic and scalable security assessment.

CyBOKClaw: Human-in-the-Loop CyBOK Mapping for Cybersecurity Curriculum

CyBOKClaw is an interpretable human-in-the-loop retrieval framework designed to map broad cybersecurity keywords to the Cyber Security Body of Knowledge (CyBOK), achieving high expert-guided mapping accuracy on both development and validation datasets.

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

The paper introduces Metacognitive Memory Policy Optimization (MMPO), a novel memory training approach that optimizes LLM memory not based on final task success, but on minimizing epistemic uncertainty in intermediate summaries, significantly improving long-horizon agent performance.

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

The paper proposes Meta-Team, an experience-driven framework that enables multi-agent systems (MAS) to collaboratively self-evolve by transforming complex execution experiences into reusable improvements for agent behaviors and coordination.

FAM-Bench: A Multimodal Benchmark for Condition-Aware Food-as-Medicine Reasoning

The paper introduces FAM-Bench, a novel multimodal benchmark designed to test advanced, condition-aware reasoning for food-as-medicine applications.

GSAM: A Generalizable and Safe Robotic Framework for Articulated Object Manipulation

GSAM introduces a generalizable and safe robotic framework for articulated object manipulation, significantly improving success rates and reducing variability across diverse tasks by integrating commonsense reasoning and explicit collision constraints.

CAREAgent: Clinical Agent with Structured Reasoning and Tool-Integrated for Order Generation

CAREAgent is a novel agent designed for fine-grained clinical order generation, achieving significant performance improvements on unseen benchmarks by integrating structured reasoning and tool usage.

MViewRouter: Internalizing Geometric Equivariance via Multi-view Alternating Attention for Combinatorial Routing

MViewRouter proposes a multi-view framework that internalizes geometric equivariance using a Multi-view Alternating Attention mechanism to improve generalization and stabilize training for combinatorial routing problems like TSP and CVRP.

Test-Time Training for Zero-Resource Dense Retrieval Reranking

The paper proposes DART, a test-time adaptation method that enhances zero-resource dense retrieval reranking by adaptively tuning a bilinear scoring matrix using pseudo-positive and pseudo-negative examples, achieving significant performance gains with minimal latency.

Large Language Models in Transportation Systems Management and Operations: From Text Reasoning to Multi-modal Decision Support

This survey reviews how Large and Multi-modal Language Models (LLMs/MM-LLMs) are being applied to integrate diverse data sources for enhanced decision support in transportation systems management and operations.

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

The paper introduces Moment-Video, a new benchmark that diagnoses the ability of video MLLMs to understand brief, critical visual events, revealing that current models struggle significantly with temporal fidelity.

Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?

This paper investigates whether adults' struggles with conjunctive causal rules persist when they have agency through active exploration.

Highlighted terms show continued research focus across papers

Papers

cs.CLEmpiricalRecentJun 4, 2026

Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?

Mandana Samiei, Eunice Yiu, Anthony GX-Chen, Dongyan Lin +4 more

This paper investigates whether adults' struggles with conjunctive causal rules persist when they have agency through active exploration.

View →
cs.CVcs.AIRecentJun 1, 2026

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

Xiaolin Liu, Yilun Zhu, Xiangyu Zhao, Xuehui Wang +8 more

The paper introduces Moment-Video, a new benchmark that diagnoses the ability of video MLLMs to understand brief, critical visual events, revealing that current models struggle significantly with temp…

View →
cs.AIRecentMay 31, 2026

CAREAgent: Clinical Agent with Structured Reasoning and Tool-Integrated for Order Generation

Ruihui Hou, Ziyue Huai, Chennuo Zhang, Ziyan Liu +4 more

CAREAgent is a novel agent designed for fine-grained clinical order generation, achieving significant performance improvements on unseen benchmarks by integrating structured reasoning and tool usage.

View →
cs.LGcs.AIRecentMay 31, 2026

MViewRouter: Internalizing Geometric Equivariance via Multi-view Alternating Attention for Combinatorial Routing

Shiyan Liu, Bohan Tan, Yaoxin Wu, Yan Jin

MViewRouter proposes a multi-view framework that internalizes geometric equivariance using a Multi-view Alternating Attention mechanism to improve generalization and stabilize training for combinatori…

View →
cs.IRcs.AIcs.LGRecentMay 31, 2026

Test-Time Training for Zero-Resource Dense Retrieval Reranking

Shiyan Liu, Yichen Li

The paper proposes DART, a test-time adaptation method that enhances zero-resource dense retrieval reranking by adaptively tuning a bilinear scoring matrix using pseudo-positive and pseudo-negative ex…

View →
cs.AIRecentMay 31, 2026

Large Language Models in Transportation Systems Management and Operations: From Text Reasoning to Multi-modal Decision Support

Siyan Li, Zehao Wang, Jiachen Li, Kanok Boriboonsomsin +2 more

This survey reviews how Large and Multi-modal Language Models (LLMs/MM-LLMs) are being applied to integrate diverse data sources for enhanced decision support in transportation systems management and…

View →
cs.AIRecentMay 29, 2026

FAM-Bench: A Multimodal Benchmark for Condition-Aware Food-as-Medicine Reasoning

Mingyang Mao, Bhargav Rishi Medisetti, Utkarsh Grover, Tanvir Ibrahim +3 more

The paper introduces FAM-Bench, a novel multimodal benchmark designed to test advanced, condition-aware reasoning for food-as-medicine applications.

View →
cs.ROcs.AIRecentMay 29, 2026

GSAM: A Generalizable and Safe Robotic Framework for Articulated Object Manipulation

Beichen Shao, Mengying Xie, Heng Su, Wanyi Zhang +4 more

GSAM introduces a generalizable and safe robotic framework for articulated object manipulation, significantly improving success rates and reducing variability across diverse tasks by integrating commo…

View →
cs.AIRecentMay 28, 2026

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

Ziyan Liu, Zhezheng Hao, Yeqiu Chen, Hong Wang +6 more

The paper introduces Metacognitive Memory Policy Optimization (MMPO), a novel memory training approach that optimizes LLM memory not based on final task success, but on minimizing epistemic uncertaint…

View →
cs.MAcs.AIRecentMay 28, 2026

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

Zhezheng Hao, Tianfu Wang, Huanshuo Dong, Ziyan Liu +6 more

The paper proposes Meta-Team, an experience-driven framework that enables multi-agent systems (MAS) to collaboratively self-evolve by transforming complex execution experiences into reusable improveme…

View →
cs.CRcs.AIRecentMay 23, 2026

CyBOKClaw: Human-in-the-Loop CyBOK Mapping for Cybersecurity Curriculum

Yan Lin Aung, Kevin Togbe

CyBOKClaw is an interpretable human-in-the-loop retrieval framework designed to map broad cybersecurity keywords to the Cyber Security Body of Knowledge (CyBOK), achieving high expert-guided mapping a…

View →
cs.CRRecentMay 21, 2026

BYOT-CPS: A Hybrid Cyber-Physical Systems Testbed for IoT Security Assessment and Platform Evaluation

Yan Lin Aung, Nelson Che Neba

The paper introduces BYOT-CPS, a hybrid cyber-physical testbed that bridges the gap between purely simulated and purely physical IoT testing environments, enabling realistic and scalable security asse…

View →
cs.CRRecentMay 15, 2026

LymphNode: A Plug-and-Play Access Control Method for Deep Neural Networks

Hanyu Pei, Shang Liu, Zeyan Liu

LymphNode is a novel, post-hoc access control framework that protects Deep Neural Networks (DNNs) from model extraction and inversion attacks by enforcing a default-deny policy and selectively restori…

View →
cs.CRRecentMay 13, 2026

From Compression to Accountability: Harmless Copyright Protection for Dataset Distillation

Yan Liang, Ziyuan Yang, Mengyu Sun, Joey Tianyi Zhou +1 more

The paper proposes SubPopMark, a novel subpopulation-driven framework that injects harmless, verifiable markers into distilled datasets to prevent copyright infringement and data leakage.

View →
cs.CVcs.AIcs.CRRecentMay 9, 2026

FraudBench: A Multimodal Benchmark for Detecting AI-Generated Fraudulent Refund Evidence

Xinyu Yan, Boyang Chen, Jiaming Zhang, Tiantong Wu +11 more

The paper introduces FraudBench, a multimodal benchmark designed to detect AI-generated fraudulent refund evidence, finding that current AI models struggle significantly with claim-conditioned fake-da…

View →
cs.CRcs.AIcs.CLRecentMay 2, 2026

Tracing the Dynamics of Refusal: Exploiting Latent Refusal Trajectories for Robust Jailbreak Detection

Xulin Hu, Che Wang, Wei Yang Bryan Lim, Jianbo Gao +1 more

The paper proposes SALO, a novel detector that monitors the dynamic, layer-wise activation pattern (Refusal Trajectory) to improve jailbreak detection robustness compared to traditional methods relyin…

View →
cs.CRcs.AIRecentMay 1, 2026

E-MIA: Exam-Style Black-Box Membership Inference Attacks against RAG Systems

Zelin Guan, Shengda Zhuo, Zeyan Li, Jinchun He +3 more

E-MIA introduces a novel, stealthy black-box membership inference attack that converts verifiable hard evidence within a candidate document into an objective, multi-part exam score to determine if the…

View →
cs.CRcs.AIcs.CLRecentApr 6, 2026

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Zijun Wang, Haoqin Tu, Letian Zhang, Hardy Chen +10 more

This paper conducts the first real-world safety evaluation of the personal AI agent OpenClaw, demonstrating that its broad system access creates inherent vulnerabilities that significantly increase th…

View →
cs.CRcs.CYcs.SIRecentMar 22, 2026

Estimating the Social Cost of Corporate Data Breaches

Lina Alkarmi, Armin Sarabi, Mingyan Liu

This study estimates the true social cost of corporate data breaches by quantifying the direct financial and opportunity costs to victims, finding that these costs can significantly exceed corporate s…

View →