Jie Li

22 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×15Crypto×11NLP×7ML×3Robotics×2Vision×2Networking×2Software Eng.×2

Frequent co-authors

Yan Wang2×

Zihan Wang2×

Weijie Liu2×

Zongjie Li2×

Shuai Wang2×

OneRec Team1×

Research Timeline

2026

PEB Separation and State Migration: Unmasking the New Frontiers of DeFi AML Evasion

The paper demonstrates that current transfer-based AML systems fail in complex DeFi environments because economic value migration can be structurally decoupled from explicit token transfers.

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode

The paper independently stress-tests Claude Code's auto mode permission system using a deliberately ambiguous benchmark, finding that its true false negative rate is significantly higher than reported, particularly due to unmonitored file edits.

Geometry-Aware Localized Watermarking for Copyright Protection in Embedding-as-a-Service

The paper proposes GeoMark, a geometry-aware localized watermarking framework that robustly protects Embedding-as-a-Service (EaaS) against model stealing and copyright infringement while preserving utility.

Behavioral Consistency and Transparency Analysis on Large Language Model API Gateways

The paper introduces GateScope, a black-box framework that audits commercial LLM API gateways, revealing frequent discrepancies in model behavior, billing, and performance across real-world services.

XekRung Technical Report

The paper introduces XekRung, a frontier large language model for cybersecurity, which achieves state-of-the-art performance on domain-specific benchmarks through a comprehensive training and evaluation pipeline.

Misrouter: Exploiting Routing Mechanisms for Input-Only Attacks on Mixture-of-Experts LLMs

Misrouter introduces an input-only adversarial framework to exploit the routing mechanisms of Mixture-of-Experts (MoE) LLMs, enabling unsafe behavior induction against remotely hosted, black-box services.

Analyzing Codes of Conduct for Online Safety in Video Games at Scale

The paper analyzes Codes of Conduct (CoCs) for online video games using a novel pipeline, finding that most multiplayer games lack CoCs despite safety needs, and that CoCs often lack specificity regarding interpersonal and underage safety harms.

OEP: Poisoning Self-Evolving LLM Agents via Locally Correct but Non-Transferable Experiences

The paper introduces Obsessive Experience Poisoning (OEP), a low-privilege black-box attack that poisons self-evolving LLM agents by generating locally correct but harmful experiences, causing dangerous over-generalization during reflection.

OpenURMA: A Clean-Room Open Implementation of the Unified Bus Protocol

OpenURMA provides the first open, clean-room implementation of Huawei's Unified Bus (UB) protocol, demonstrating a significant reduction in latency and increase in throughput for remote memory access compared to existing RDMA standards like RoCEv2.

ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation

ADWIN introduces an adaptive window framework for on-policy distillation (OPD) that efficiently manages the supervision horizon by training on short, teacher-anchored prefixes while using delayed full-rollout probes to maintain alignment, significantly reducing training cost while preserving accuracy.

Planning with the Views via Scene Self-Exploration

The paper addresses the challenge of multi-turn view planning for VLMs by proposing an iterative framework that uses self-exploration and view graph distillation, significantly improving planning performance over state-of-the-art models.

GaMi: Geometry-Agnostic Material Identification via Cross-Modal Subtractive Disentanglement

GaMi is a multimodal material identification system that uses mmWave and acoustic sensing with a cross-modal subtractive disentanglement framework to achieve high accuracy (95.2%) for material identification regardless of geometric variations.

WaveFilter: Enhancing the Long-Context Capability of Diffusion LLMs via Wavelet-Guided KV Cache Filtering

WaveFilter is a novel, training-free framework that uses wavelet transforms to efficiently filter critical tokens in the KV cache, significantly improving the long-context performance of Diffusion LLMs.

Connecting the Dots: Benchmarking Reflective Memory in Long-Horizon Dialogue

The paper introduces RefMem-Bench, a new benchmark for measuring reflective memory in long-horizon dialogue, and proposes REMIND, a framework that significantly improves models' ability to synthesize fragmented cues into high-level interpretations.

LongAttnComp: Cross-Family Context Compression for Long-Context Reasoning

LongAttnComp introduces a novel, two-stage fine-tuning framework for context compression that significantly improves long-context reasoning performance, matching or exceeding full-context accuracy on demanding tasks like code debugging.

COMAP: Co-Evolving World Models and Agent Policies for LLM Agents

COMAP introduces a novel co-evolutionary framework that simultaneously updates textual world models and agent policies through closed-loop interaction, significantly improving long-horizon decision-making for LLM agents.

SentGuard: Sentence-Level Streaming Guardrails for Large Language Models

SentGuard introduces a novel sentence-level streaming guardrail that operates in parallel with LLM generation, achieving high detection rates of unsafe content early in the response while maintaining low false-positive rates.

Federated Learning for Multi-Center Sepsis Early Prediction with Privacy-Preserving

This study successfully demonstrates that federated learning can achieve prediction accuracy comparable to centralized modeling for multi-center sepsis prediction while fundamentally preserving patient data privacy.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

RiskFlow: Fast and Faithful Safety-Critical Traffic Scenario Generation

RiskFlow is a novel framework that generates realistic and safety-critical multi-agent traffic scenarios by reformulating trajectory generation as a single-pass transport problem in the action space.

Highlighted terms show continued research focus across papers

Papers

cs.IRcs.AIcs.CLRecentJun 4, 2026

OneReason Technical Report

OneRec Team, Biao Yang, Boyang Ding, Chenglong Chu +80 more

View →

cs.ROcs.AIRecentJun 4, 2026