Yong Wang
4 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes FreeUp, a frequency-decoupled framework that improves encrypted network anomaly detection by separately modeling and fusing low- and high-frequency components of traffic data.
The paper investigates whether using fine-grained, tensorized adapters (CP components) instead of standard LoRA ranks improves the accuracy-budget trade-off in PEFT, finding that while they fill budget gaps, the benefit is highly task-dependent and does not guarantee superior performance.
The paper proposes Cross-Layer Sparse Attention (CLSA) to significantly improve the efficiency and accuracy of long-context LLMs by jointly optimizing KV-cache sharing and the routing index across decoder layers.
This paper proposes a new method for agentic Reinforcement Learning called Agentic Procedural Policy Optimization (APPO) that improves tool-use capabilities by assigning credit to fine-grained decision points.
Papers
APPO: Agentic Procedural Policy Optimization
Xucong Wang, Ziyu Ma, Yong Wang, Yuxiang Ji +4 more
This paper proposes a new method for agentic Reinforcement Learning called Agentic Procedural Policy Optimization (APPO) that improves tool-use capabilities by assigning credit to fine-grained decisio…