Heng Guo
3 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes using GUI agents, both as objective evaluators and subjective playtesters, to significantly improve the generation of playable games from prompts, demonstrating a 66.8% rubric pass-rate with a novel iterative framework.
The paper introduces Pocket-Dentist, an efficiency-aware benchmark and model that demonstrates that compact, smaller Vision-Language Models (VLMs) can outperform larger models in accuracy while drastically reducing computational cost for on-device dental image understanding.
The paper demonstrates that audio-language models often ignore conflicting audio evidence in favor of text, and proposes a training-free decoding rule, GACL, that significantly improves faithfulness by correcting this arbitration bias.
Papers
Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models
Yichen Gao, Yiqun Zhang, Zijing Wang, Yujia Li +6 more
The paper demonstrates that audio-language models often ignore conflicting audio evidence in favor of text, and proposes a training-free decoding rule, GACL, that significantly improves faithfulness b…