Lei Xu

8 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×6Vision×2ML×2NLP×1Distributed×1Performance×1Software Eng.×1HCI×1

Frequent co-authors

Haolei Xu2×

Haiwen Hong2×

Hongxing Li2×

Weiming Lu2×

Yongliang Shen2×

Kuan Li2×

Research Timeline

2026

CardioLens: Revealing the Clinical Reality Gap of MLLMs via Multi-Sequence Cardiac MRI Evaluations

The paper introduces CardioLens, a rigorous evaluation testbed for multi-sequence Cardiac MRI, which reveals that current Multimodal Large Language Models (MLLMs) exhibit a significant 'clinical reality gap' and perform poorly when simulating real-world cardiac interpretation workflows.

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

This study analyzes over 20,000 real-world coding sessions to show that AI coding agents frequently fail users through subtle misalignment, requiring constant manual correction even when major system damage is avoided.

HomeFlow: A Data Flywheel for Smart Home Agent Training with Verifiable Simulation

The paper introduces HomeFlow, a verifiable data flywheel that procedurally generates high-quality, multi-turn training data for smart home agents, achieving state-of-the-art performance on smart home tasks.

Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling

The paper introduces FTDiff, a reinforcement learning fine-tuning framework that efficiently generates high-quality, drug-like molecules constrained by a target protein structure, outperforming existing methods.

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes

The paper introduces SMH-Bench, a comprehensive benchmark built on a simulator to rigorously test LLM agents' ability to perform complex, environment-grounded reasoning and actions in realistic smart-home scenarios.

Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning

This paper proposes Perceive-to-Reason (P2R), a framework for fine-grained visual reasoning that decouples perception from reasoning and introduces a new reinforcement learning strategy.

TileSight: A First-Principles Tile-Centric Analytical GPU Performance Model from Cores to Clusters

TileSight is a tile-centric performance-modeling tool that predicts single-GPU kernel latency and cache hit rates with low error, outperforming state-of-the-art baselines and transferring well across architectures.

Pass the Baton: Trajectory-Relayed On-Policy Distillation

The paper introduces Relay On-Policy Distillation (Relay-OPD), a method for on-policy distillation that constructs relay trajectories to address prefix failure and improve performance.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AINEWEmpiricalJul 28, 2026

Pass the Baton: Trajectory-Relayed On-Policy Distillation

Haolei Xu, Xiaowen Xu, Haiwen Hong, Zixuan Ni +4 more

The paper introduces Relay On-Policy Distillation (Relay-OPD), a method for on-policy distillation that constructs relay trajectories to address prefix failure and improve performance.

View →

cs.DCcs.PF