Xin Li

37 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×25Crypto×13NLP×11Vision×6ML×5Software Eng.×3Distributed×2Info Retrieval×2

Frequent co-authors

Yuexin Li3×

Yulin Chen3×

Yufei He3×

Tri Cao3×

Bryan Hooi3×

Jie Zhang2×

Research Timeline

2026

Dive into Waves: Morlet Spectral Transformer for Cross-Subject Emotion Decoding from EEG

The paper proposes the Morlet Spectral Transformer (MST), a novel architecture that effectively decodes cross-subject emotion from EEG by designing specialized spectral and spatial representations, outperforming existing large foundation models.

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

The paper introduces Moment-Video, a new benchmark that diagnoses the ability of video MLLMs to understand brief, critical visual events, revealing that current models struggle significantly with temporal fidelity.

InfoMerge: Information-aware Token Compression for Efficient Video Large Language Models

InfoMerge is a novel, training-free method that significantly compresses visual tokens for Video-LLMs by estimating temporal redundancy and allocating tokens based on content richness, achieving high efficiency with minimal performance loss.

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

The paper introduces SPADE-Bench, a new benchmark designed to rigorously evaluate 'agent deception'—the divergence between an agent's reported plan and its actual executed actions—which is a critical safety issue for autonomous LLM agents.

CAPF: Guiding Search-Agent Rollouts with Credit-Attenuated Privileged Feedback

The paper proposes Credit-Attenuated Privileged Feedback (CAPF), a training-time mechanism that uses verifier-side information to guide LLM search agents, significantly improving their performance on complex QA tasks.

Revisiting Ripple Effects in Knowledge Editing through Pressure-Aware Joint Neighborhood Optimization

The paper proposes Joint Neighborhood Optimization (JNO), a novel knowledge-editing framework that jointly addresses the coupled pressures of desirable knowledge propagation and unintended knowledge leakage during single-edit updates in LLMs.

Joint Agent Memory and Exploration Learning via Novelty Signals

The JAMEL framework addresses the challenge of effective exploration in open-ended environments by jointly training agent memory and exploration policies using natural, novelty-driven signals.

Privacy-preserving Information Sharing in Oligopoly Competitions

The paper analyzes information-sharing mechanisms in oligopolies, finding that privacy protection alone is insufficient to incentivize suppliers to share data; successful sharing requires combining privacy safeguards with a sufficiently informative external signal.

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards

QUBRIC introduces a co-design framework that simultaneously optimizes queries and rubrics, overcoming the bottleneck of vague rubrics derived from open-ended questions, leading to significant gains in RL performance.

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

MLEvolve is a novel self-evolving multi-agent framework that enables LLM agents to discover and optimize machine learning algorithms for complex, long-horizon tasks.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

CORE-Bench: A Comprehensive Benchmark for Code Retrieval in the Era of Agentic Coding

This paper introduces CORE-Bench, a comprehensive benchmark for code retrieval in agentic coding.

COSM: A Cooperative Scheduling Framework for Concurrent PIM and CPU Execution on Mobile Devices

The paper introduces COSM, a cooperative scheduling framework to facilitate concurrent operation of Processing-in-Memory (PIM) and CPU tasks on mobile platforms, improving PIM throughput by up to 2.8x with less than 2.0% CPU performance loss.

Hierarchical Acoustic-Semantic Modeling: Modality Separation and Semantic Coherence for Full-Duplex SLMs

This paper identifies the root cause of performance degradation in full-duplex Spoken Language Models (SLMs) due to modality interference and proposes Lychee-FD, a framework that decouples conflicting modalities in deep layers while preserving cross-modality coherence.

From RGB Generation to Dense Field Readout: Pixel-Space Dense Prediction with Text-to-Image Models

This paper proposes ReChannel, a method for dense prediction using a pretrained DiT model, which keeps the encoder but removes the decoder and adapts it with task LoRA. ReChannel maps each token to its corresponding pixel-space patch through a shared linear head.

MonoIR-RS: Infrared Remote Sensing Vision-Language Learning with CLIP and VLM Adaptation

This paper introduces MonoIR-RS, a large-scale infrared remote-sensing vision-language dataset and benchmark for understanding infrared imagery.

Deep Interaction: An Efficient Human-AI Interaction Method for Large Reasoning Models

This paper proposes an efficient human intervention mechanism, Deep Interaction, for correcting reasoning errors in large language models, achieving over 25% improvement in correction success rate and reducing token usage by approximately 40%.

Scalable LLM Agent Tool Access in the Cloud

A cloud-scale gateway system for MCP services is presented, which breaks the direct-connect model and offloads legacy service integration, consolidates incompatible MCP variants, and reduces tool selection time and token usage.

End-to-End Markov State Sequence Learning for Auditory Attention Decoding

This paper proposes an end-to-end Markov framework for auditory attention decoding using conditional random fields and an EEG--speech correlation backbone.

Where Is the Cost of Third-Party API Routers in Agentic Software Development?

This paper conducts an empirical study on the effects of router-side injection in coding agents and evaluates the effectiveness of existing client-side safeguards.

Highlighted terms show continued research focus across papers

Papers

cs.SEcs.AIcs.CLEmpiricalRecentJul 26, 2026

Where Is the Cost of Third-Party API Routers in Agentic Software Development?

Donghao Fu, Jingxin Li, Xue Jiang, Yihong Dong

This paper conducts an empirical study on the effects of router-side injection in coding agents and evaluates the effectiveness of existing client-side safeguards.

View →

cs.SDcs.HCEmpirical