Peng Li

24 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×14Crypto×9Software Eng.×3Vision×2ML×2Robotics×2Sound×1Architecture×1

Frequent co-authors

Peng Liu7×

Wei Zhou2×

Jinpeng Liu2×

Yunpeng Li2×

Xiang Wang2×

Xiaopeng Li2×

Research Timeline

2026

More Than Meets the Eye: A Semantics-Aware Traffic Augmentation Framework for Generalizable Website Fingerprinting

The paper proposes SATA, a semantics-aware traffic augmentation framework, to significantly improve the generalization of website fingerprinting models by addressing variability in resource composition and cross-layer feature instability.

Stop Starving or Stuffing Me: Boosting Firmware Fuzzing Efficiency with On-demand Input Delivery

The paper introduces FIDO, a novel framework that significantly boosts firmware fuzzing efficiency by accurately managing the timing and quantity of input delivery based on the firmware's internal input availability checks.

Not What You Asked For: Typographic Attacks in Household Robot Manipulation

This paper demonstrates that typographic attacks pose a significant, measurable, and physically consequential threat to household robot manipulation systems by causing the robot to grasp and transport the wrong objects.

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

The paper introduces HRBench, a unified and comprehensive evaluation framework for systematically benchmarking and comparing various thinking-mode switching strategies in hybrid-reasoning LLMs.

PetroBench: A Benchmark for Large Language Models in Petroleum Engineering

The paper introduces PetroBench, a comprehensive benchmark for evaluating Large Language Models across various domains of petroleum engineering, finding that models perform better on subjective tasks than on objective factual knowledge.

V2I Work Zone Geometry Reconstruction with Pose-Conditioned UWB Range Denoising

The paper proposes a pose-conditioned, permutation-equivariant denoiser to accurately reconstruct work zone geometry using noisy Ultra-Wideband (UWB) range data from connected and autonomous vehicles (CAVs).

Strengthening Polymorphic Prompt Assembling: Dynamic Separator Generation Against Emerging Prompt Injection Attacks

The paper introduces dynamic, per-request separator generation for Polymorphic Prompt Assembling (PPA), significantly reducing the blast-radius vulnerability to prompt injection attacks by ensuring unique separators for every request.

Generating Graph-like Rules for Knowledge Graph Reasoning via Diffusion Models

The paper proposes GRiD, a novel framework that uses a two-phase training strategy (supervised pre-training and RL fine-tuning) to discover complex, graph-like rules for knowledge graph reasoning, overcoming limitations of existing methods.

Bridging Requirements and Architecture: Multi-Agent Orchestration with External Knowledge and Hierarchical Memory

The paper introduces MAAD, a multi-agent framework that autonomously transforms software requirements into comprehensive, multi-view architectural blueprints, significantly improving completeness and reducing manual validation.

Efficient Exploration for Iterative Nash Preference Optimization

The paper proposes a novel, explicitly exploratory iterative Nash Learning from Human Feedback (NLHF) algorithm that achieves strong regret bounds for optimizing LLMs based on complex, non-scalar human preferences.

Not All Errors Are Equal: A Systematic Study of Error Propagation in Large Language Model Inference

This paper systematically studies how soft errors propagate during Large Language Model (LLM) inference using a novel fault-injection framework, providing critical insights and mitigation strategies for improving LLM reliability.

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

The paper argues that observed gains in multimodal agents using tools may be due to learning tool-calling patterns rather than genuine capability expansion, finding that tool access provides little consistent aggregate improvement.

TROPHIES: Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos

TROPHIES introduces a unified framework to jointly reconstruct dynamic humans, static scenes, and camera poses from multi-view videos, achieving globally consistent and physically plausible 4D reconstructions.

What to Format and How: A Benchmark and Workflow Approach for Document Formatting

The paper introduces DocFormBench, a new benchmark for content-aware document formatting, and proposes DocFormFlow, a workflow that improves formatting accuracy and efficiency by decoupling target localization from modification execution.

Protecting K-Nearest Neighbor Queries from Location Inference Attacks

This paper identifies two novel location inference attacks against k-nearest neighbor queries (kNNQ) and proposes DPRS, a differential privacy framework that effectively protects location privacy while maintaining high query utility.

TAHOE: Text-to-SQL with Automated Hint Optimization from Experience

The paper presents Tahoe, a system that optimizes Text-to-SQL performance through dynamic data management and hint learning.

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

AutoPass is a multi-agent framework for compiler performance tuning using a large language model, enabling it to query compiler-internal optimization states and analyze the intermediate representation for latency-improving edits.

Agent-Native Immune System: Architecture, Taxonomy, and Engineering

This paper introduces the Agent-Native Immune System (ANIS), an endogenous defense architecture for autonomous agents against runtime hijacking.

In-situ Indexing via Memristive Content-Addressable Memory

The paper introduces PATH, an in-situ indexing architecture for Processing-in-Memory systems that achieves higher throughput, lower tail latency, and fewer memory accesses than state-of-the-art schemes.

PS4: Proxy-Supervised Joint Training for Real Target Speaker Extraction

The paper introduces PS4, a framework for training target speaker extraction models using a large-scale corpus and proxy-supervised joint training strategy.

Highlighted terms show continued research focus across papers

Papers

cs.SDcs.AIEmpiricalRecentJul 9, 2026

PS4: Proxy-Supervised Joint Training for Real Target Speaker Extraction

Wanyi Ning, Wei Zhou, Yingpeng Li, Yinshang Guo +2 more

The paper introduces PS4, a framework for training target speaker extraction models using a large-scale corpus and proxy-supervised joint training strategy.

View →

cs.ARcs.ETEmpirical