Haokai Ma
3 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
SnapGuard proposes a lightweight, multimodal method to detect prompt injection attacks in screenshot-based web agents by analyzing visual stability and contrast-polarity textual signals, achieving high accuracy with significantly reduced computational overhead.
The paper proposes TRACE, a trajectory risk-aware compression method, to effectively aggregate sparse and delayed safety evidence across long agent trajectories, achieving state-of-the-art performance on multiple safety benchmarks.
ResMerge proposes a residual-based spectral merging framework that improves the combination of multiple reinforcement learning (RL) expert models by stabilizing the aggregation process using a residual backbone.
Papers
ResMerge: Residual-based Spectral Merging of Large Language Models
Yandu Sun, Zhiyan Hou, Haokai Ma, Yuheng Jia +5 more
ResMerge proposes a residual-based spectral merging framework that improves the combination of multiple reinforcement learning (RL) expert models by stabilizing the aggregation process using a residua…