Liang Wu

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×2NLP×2ML×2Vision×1Multimedia×1

Frequent co-authors

Xinhao Song1×

Su Su1×

Sirui Song1×

Hongliang Wu1×

Wen Shen1×

Zhihua Wei1×

Research Timeline

2026

PhoneWorld: Scaling Phone-Use Agent Environments

The paper introduces PhoneWorld, a scalable pipeline that automatically converts real-world GUI trajectories and screenshots into controllable, reproducible phone-use environments, significantly improving agent performance across multiple mobile benchmarks.

HLL: Can Agents Cross Humanity's Last Line of Verification?

The paper introduces HLL, a benchmark that tests if multimodal agents can successfully substitute for human verification (like CAPTCHA) in complex, real-world workflows, finding that current agents are still brittle and fail under realistic conditions.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.CLcs.CVRecentJun 1, 2026

HLL: Can Agents Cross Humanity's Last Line of Verification?

Xinhao Song, Su Su, Sirui Song, Hongliang Wu +5 more

View →

cs.CLcs.AIcs.LGRecentMay 28, 2026