Xiang Li

27 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×17Crypto×6ML×5NLP×4Vision×3Robotics×2Audio and Speech Processing×2Software Eng.×2

Frequent co-authors

Zhaoxiang Liu3×

Di Wu2×

Shiguo Lian2×

Xiang Liu2×

Xiao Zhang1×

Jiaxuan Li1×

Research Timeline

2026

Plant, Persist, Trigger: Sleeper Attack on Large Language Model Agents

This paper introduces the concept of 'Sleeper Attack,' demonstrating that adversarial content can persist across multiple interactions with an LLM agent, posing a more subtle and difficult-to-detect safety threat than single-interaction attacks.

Reasoning Matters: Mitigate Hallucination in Multimodal Large Reasoning Models via Reasoning-Conditioned Preference Optimization

The paper proposes Reasoning-Conditioned Direct Preference Optimization (RC-DPO) to effectively mitigate hallucinations in multimodal large reasoning models by explicitly conditioning the preference optimization on the Chain-of-Thought (CoT) process.

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

The paper introduces Crafter, a multi-agent harness that significantly improves the generation of editable, publication-quality scientific figures from diverse inputs, addressing the limitations of existing single-purpose systems.

Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents

The paper introduces Agora, a domain-aware multi-agent framework that successfully detects deep, previously unknown logic bugs in complex consensus protocols, outperforming existing LLM-based analysis methods.

ESPO: Early-Stopping Proximal Policy Optimization

ESPO is a novel reinforcement learning algorithm that detects trajectory failure in large language models and terminates rollouts early, significantly improving performance on mathematical reasoning benchmarks while reducing computational cost.

BAGEN: Are LLM Agents Budget-Aware?

This paper introduces the concept of Budget-Aware Agents (BAGEN), showing that current LLM agents often fail to manage resources proactively, and proposes that incorporating early stop and interval estimation significantly improves efficiency.

Learning Cardiac Latent Representations in Vectorcardiogram Space

This paper introduces LVCG, a novel self-supervised framework that learns unified, view-invariant latent representations of cardiac electrical activity directly in the physically grounded Vectorcardiogram (VCG) space, improving generalization over traditional ECG-space methods.

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents

The paper introduces MASA, a model-aware skill alignment framework that adaptively rewrites general and task-specific skills for LLM agents, achieving superior performance across diverse backbones and environments.

Initialization is Half the Battle: Generating Diverse Images from a Guidance Potential Posterior

The paper introduces Diversity-inducing Initialization (DivIn), a novel method that improves image diversity by re-weighting the initial noise selection based on the guidance potential, thereby mitigating mode collapse.

eMoT: evolving Memory-of-Thought via Symbolic Anchoring and Memory Corrosion

The eMoT framework enhances multi-step reasoning in LLMs by treating reasoning as an evolving memory, stabilizing performance through symbolic computation and structured refinement.

Community-Aware Assessment of Social Textual Engagement and Resonance: A Human-Centric Perspective on User-Generated Content Evaluation

The paper introduces CASTER, a new human-centric task for evaluating User-Generated Content (UGC) resonance, and proposes MEDEA, an architecture that uses a Social Chain-of-Thought mechanism to simulate community reactions for quality assessment.

Federated Learning for Multi-Center Sepsis Early Prediction with Privacy-Preserving

This study successfully demonstrates that federated learning can achieve prediction accuracy comparable to centralized modeling for multi-center sepsis prediction while fundamentally preserving patient data privacy.

Token-Operations-Oriented Inference Optimization Techniques for Large Models

This paper proposes a four-layer technical architecture for large model inference optimization, including Multi-model Fusion, Model Optimization, Compute-Model Fusion, and Compute-Network-Model Fusion.

Adversarial Contamination Meets Hard Thresholding: An Iterative Algorithm with Signal Adaptivity and Minimax Optimality

This paper proposes a two-stage algorithm, AC-IHT, for high-dimensional regression with contamination, achieving near-optimal estimation and strong oracle property.

KernelFlume: Elastic Core-Attention Scaling for Agentic Long-Context Decoding

KernelFlume is a decode-centric architecture that disaggregates the stable projection/FFN path from core-attention computation to improve efficiency and reduce cost in serving long-context demand.

Chronos: A Physics-Informed Full-History Framework for Non-Markovian Long-Horizon Manipulation

This paper introduces Chronos, a physics-informed framework for non-Markovian long-horizon manipulation, which elevates observation history to the latent state of the policy dynamics and achieves higher success rates and fewer parameters than Markovian VLA baselines in both simulated and real-world experiments.

Experience Graphs: The Data Foundation for Self-Improving Agents

This paper proposes Trellis, a data foundation that treats experience graphs from long-horizon agentic tasks as first-class, governed, queryable database state.

CRISP: Constrained Refinement via Iterative Squeezing Process for Robust Medical Image Segmentation under Domain Shift

This paper proposes CRISP, a model-agnostic framework for source-only medical image segmentation under distribution shift, which uses rank stability of positive regions to derive robust spatial priors.

SALMONN-2: Advancing General-Purpose Hearing Abilities with Self-Supervised Representations

The paper proposes SALMONN-2, an ALLM built on a unified SSL encoder, and presents a multi-layer feature fusion adapter to better exploit hierarchical SSL encoder representations. It also explores multimodal in-context learning in ALLMs and shows that a general-purpose SSL encoder achieves comparable performance to specialized audio encoders.

RL-MACRO: A Cybernetic Closed-Loop Intelligence Framework for Multimodal Adaptive Robotic Craniotomy

This paper proposes RL-MACRO, a cybernetic closed-loop intelligence framework for autonomous robotic craniotomy, which includes a CNN-LSTM observer for temperature reconstruction, an offline Implicit Q-Learning policy, and a novel dual-head Actor for coordinating cutting parameters.

Highlighted terms show continued research focus across papers

Papers

cs.ROEmpiricalRecentJul 23, 2026

RL-MACRO: A Cybernetic Closed-Loop Intelligence Framework for Multimodal Adaptive Robotic Craniotomy

Xiao Zhang, Jiaxuan Li, Renzhen Le, Di Wu +8 more

View →

eess.ASEmpiricalRecent