Yan Li

27 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×17Crypto×11NLP×5ML×4Vision×3Robotics×2Info Retrieval×2Sound×1

Frequent co-authors

Ziyan Liu3×

Xukun Luan2×

Gang Zhang2×

Jinyan Liu2×

Shaofeng Zhang2×

Zhihang Zhong2×

Research Timeline

2026

BYOT-CPS: A Hybrid Cyber-Physical Systems Testbed for IoT Security Assessment and Platform Evaluation

The paper introduces BYOT-CPS, a hybrid cyber-physical testbed that bridges the gap between purely simulated and purely physical IoT testing environments, enabling realistic and scalable security assessment.

CyBOKClaw: Human-in-the-Loop CyBOK Mapping for Cybersecurity Curriculum

CyBOKClaw is an interpretable human-in-the-loop retrieval framework designed to map broad cybersecurity keywords to the Cyber Security Body of Knowledge (CyBOK), achieving high expert-guided mapping accuracy on both development and validation datasets.

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

The paper introduces Metacognitive Memory Policy Optimization (MMPO), a novel memory training approach that optimizes LLM memory not based on final task success, but on minimizing epistemic uncertainty in intermediate summaries, significantly improving long-horizon agent performance.

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

The paper proposes Meta-Team, an experience-driven framework that enables multi-agent systems (MAS) to collaboratively self-evolve by transforming complex execution experiences into reusable improvements for agent behaviors and coordination.

FAM-Bench: A Multimodal Benchmark for Condition-Aware Food-as-Medicine Reasoning

The paper introduces FAM-Bench, a novel multimodal benchmark designed to test advanced, condition-aware reasoning for food-as-medicine applications.

GSAM: A Generalizable and Safe Robotic Framework for Articulated Object Manipulation

GSAM introduces a generalizable and safe robotic framework for articulated object manipulation, significantly improving success rates and reducing variability across diverse tasks by integrating commonsense reasoning and explicit collision constraints.

CAREAgent: Clinical Agent with Structured Reasoning and Tool-Integrated for Order Generation

CAREAgent is a novel agent designed for fine-grained clinical order generation, achieving significant performance improvements on unseen benchmarks by integrating structured reasoning and tool usage.

MViewRouter: Internalizing Geometric Equivariance via Multi-view Alternating Attention for Combinatorial Routing

MViewRouter proposes a multi-view framework that internalizes geometric equivariance using a Multi-view Alternating Attention mechanism to improve generalization and stabilize training for combinatorial routing problems like TSP and CVRP.

Test-Time Training for Zero-Resource Dense Retrieval Reranking

The paper proposes DART, a test-time adaptation method that enhances zero-resource dense retrieval reranking by adaptively tuning a bilinear scoring matrix using pseudo-positive and pseudo-negative examples, achieving significant performance gains with minimal latency.

Large Language Models in Transportation Systems Management and Operations: From Text Reasoning to Multi-modal Decision Support

This survey reviews how Large and Multi-modal Language Models (LLMs/MM-LLMs) are being applied to integrate diverse data sources for enhanced decision support in transportation systems management and operations.

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

The paper introduces Moment-Video, a new benchmark that diagnoses the ability of video MLLMs to understand brief, critical visual events, revealing that current models struggle significantly with temporal fidelity.

Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?

This paper investigates whether adults' struggles with conjunctive causal rules persist when they have agency through active exploration.

ARTSN: Exact and Adaptive Self-triggered Traffic Scheduling for ARTS Networks

This paper proposes ARTSN, a scheduling paradigm for autonomous real-time systems using time-sensitive networking, addressing volatility and absence challenges of self-triggered traffic.

ShopX: A Foundation Model for Intent-to-Item Fulfillment in Agentic Shopping

This paper proposes ShopX, a model-centric framework for intent-driven shopping experiences using a single foundation model for intent understanding, execution planning, and item-space operations.

Clinical Translation of Brain-Computer Interface in China: A Landscape Analysis of Investigator-Initiated Trials, Registered Clinical Trials, and Regulatory Approval

This paper presents the first quantitative analysis of China's Brain-Computer Interface (BCI) translational ecosystem, examining clinical trials, investigator-initiated trials, and regulatory-approved products.

Ideas Have Genomes: Benchmarking Scientific Lineage Reasoning and Lineage-Grounded Idea Generation

The paper introduces IdeaGene-Bench, a benchmark for scientific lineage reasoning and idea generation, which includes 1,961 golden lineage traces, 1,085 curated Idea Genome objects, and 920 pairwise GenomeDiff records.

Is External Database Protection Static in Retrieval-Augmented Generation? Rethinking Privacy Preservation under Dynamic Queries

This paper proposes PA-HDP, a framework for privacy-preserving retrieval-augmented generation using prompt-aware dynamic hierarchical differential privacy.

Code-Poisoning Property Inference Attacks

This paper introduces Code-Poisoning Property Inference Attack (CPPIA), the first code-level Property Inference Attack (PIA) for leaking privacy from ML models trained on private data, overcoming limitations of existing works.

StellarTTS: Sparse Temporal Embedding for Low-Latency and Robust Speech Synthesis

This paper introduces StellarTTS, a mobile-optimized non-autoregressive text-to-speech framework with sparse temporal embeddings and a semantic-aware codec, achieving lower latency and stronger robustness.

GS-Agent: Creating 4D Physical Worlds With Generative Simulation

This paper introduces GS-Agent, an end-to-end multi-agent framework that generates realistic, dynamic, and controllable 4D physical worlds from natural language descriptions by emulating human creation process using physics engines.

Highlighted terms show continued research focus across papers

Papers

cs.ROcs.AIcs.CLEmpiricalRecentJul 23, 2026

GS-Agent: Creating 4D Physical Worlds With Generative Simulation

Hongxin Zhang, Chunru Lin, Junyan Li, Zhou Xian +2 more

View →

cs.SDEmpirical