Zhipeng Wang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
This paper analyzes the x402 agentic payment protocol, demonstrating through five concrete, practical attacks that it is vulnerable across multiple stages of its payment workflow.
S-SPPO introduces a dual-space semantic calibration framework to stabilize Self-Play Preference Optimization (SPPO), preventing policy degeneration when preference oracles assign overly confident wins to semantically similar responses.
Papers
S-SPPO: Semantic-Calibrated Self-Play Preference Optimization
Xiwen Chen, Wenhui Zhu, Jingjing Wang, Peijie Qiu +12 more
S-SPPO introduces a dual-space semantic calibration framework to stabilize Self-Play Preference Optimization (SPPO), preventing policy degeneration when preference oracles assign overly confident wins…