Hui Wu
3 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes a novel identity-based public key management framework, IPK-pq, utilizing NIST ML-DSA and random matrix theory to enhance the scalability and efficiency of Public Key Infrastructure (PKI) for large-scale, post-quantum environments.
The paper addresses the reliability of open-weight LLMs for power system code generation by identifying structured API-knowledge boundary errors and proposing a boundary-aware intervention that significantly boosts accuracy without fine-tuning.
The paper introduces OpenWebRL, an open framework that enables training visual web agents using online multi-turn Reinforcement Learning directly on live websites, achieving state-of-the-art performance on challenging web benchmarks.
Papers
OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents
Rui Yang, Qianhui Wu, Yuxi Chen, Hao Bai +6 more
The paper introduces OpenWebRL, an open framework that enables training visual web agents using online multi-turn Reinforcement Learning directly on live websites, achieving state-of-the-art performan…