Peng Wang

14 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×9ML×5NLP×5Crypto×5Vision×3Audio and Speech Processing×1Info Retrieval×1physics.comp-ph×1

Frequent co-authors

Zhipeng Wang3×

Yanqiao Zhu2×

Wupeng Wang2×

Xipeng Qiu2×

Kai Yu2×

Zhifu Gao2×

Research Timeline

2026

LoopTrap: Termination Poisoning Attacks on LLM Agents

The paper introduces LoopTrap, an automated red-teaming framework that demonstrates how malicious prompts can poison the termination judgment of LLM agents, causing unbounded computation.

Five Attacks on x402 Agentic Payment Protocol

This paper analyzes the x402 agentic payment protocol, demonstrating through five concrete, practical attacks that it is vulnerable across multiple stages of its payment workflow.

Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization

This paper analyzes the multi-regime behavior of Scientific Machine Learning (SciML) models, finding that optimization effectiveness is regime-specific and that failure modes require a unified, regime-aware diagnostic approach.

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning

VCap introduces a novel Witness-Adjudicator reward mechanism that provides highly precise, factually grounded feedback for visual captioning, enabling state-of-the-art performance in RL-trained multimodal models.

FinBoardBench: Benchmarking Dynamic Wealth Management and Strategic Financial Reasoning of LLMs via Board Game Simulations

The paper introduces FinBoardBench, a novel evaluation suite using financial board games to demonstrate that current LLMs, despite strong static reasoning, fail at complex, dynamic wealth management and strategic decision-making.

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

The paper introduces AgentDoG 1.5, a lightweight and scalable alignment framework that significantly improves AI agent safety and security for complex, open-world agentic scenarios.

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation

The paper proposes Agentic ASR, a closed-loop framework that treats ASR as a multi-turn refinement task, significantly improving semantic accuracy over traditional token-level metrics.

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

The paper introduces AgentDoG 1.5, a lightweight and scalable alignment framework that significantly improves AI agent safety and security for complex open-world agent deployments.

LoRA-Key: User-Centric LoRA Watermarking for Text-to-Image Diffusion Models

LoRA-Key introduces a user-centric watermarking framework that attaches a recoverable ownership key to LoRA modules via a standalone Watermark LoRA, providing lightweight, plug-and-play copyright protection without requiring per-LoRA retraining.

eMoT: evolving Memory-of-Thought via Symbolic Anchoring and Memory Corrosion

The eMoT framework enhances multi-step reasoning in LLMs by treating reasoning as an evolving memory, stabilizing performance through symbolic computation and structured refinement.

S-SPPO: Semantic-Calibrated Self-Play Preference Optimization

S-SPPO introduces a dual-space semantic calibration framework to stabilize Self-Play Preference Optimization (SPPO), preventing policy degeneration when preference oracles assign overly confident wins to semantically similar responses.

Tail-Aware Adaptive-k: Query-Adaptive Context Selection for Retrieval-Augmented Generation

This paper proposes Tail-Aware Adaptive-k (TAA-k), a training-free framework for adaptive context selection in retrieval-augmented generation systems using Extreme Value Theory.

GigaSpeechBench: A Real-World Multilingual Speech-to-Text Benchmark

The paper introduces GigaSpeechBench, a comprehensive multilingual and multidimensional ASR & AST benchmark with 680 hours of human-annotated speech, featuring 12 low-resource languages, 6 Chinese dialects, 6 English accents, dense terminology, older adult and child speech, and human-annotated translations.

TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning

This paper proposes TRIAGE, a role-typed credit assignment framework for agentic reinforcement learning to address the structural incompleteness of standard GRPO.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIEmpiricalRecentJun 30, 2026

TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning

Yuanda Xu, Zhengze Zhou, Hejian Sang, Xiaomin Li +4 more

This paper proposes TRIAGE, a role-typed credit assignment framework for agentic reinforcement learning to address the structural incompleteness of standard GRPO.

View →

eess.ASEmpiricalRecent