Hardy Chen

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×2Crypto×1NLP×1

Frequent co-authors

Yuyin Zhou2×

Junqi Liu1×

Salena Song1×

Yuhan Wang1×

Jiawei Mao1×

Xiaoke Huang1×

Research Timeline

2026

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

This paper conducts the first real-world safety evaluation of the personal AI agent OpenClaw, demonstrating that its broad system access creates inherent vulnerabilities that significantly increase the attack success rate regardless of the underlying large language model.

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

The paper introduces AutoMedBench, a novel workflow-aware benchmark that evaluates autonomous medical-AI agents across a five-stage research process, revealing that agents struggle most with validation and submission.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentJun 1, 2026

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

Junqi Liu, Salena Song, Yuhan Wang, Jiawei Mao +11 more

View →

cs.CRcs.AIcs.CLRecentApr 6, 2026