Jiacheng Liang

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2AI×2NLP×1ML×1

Frequent co-authors

Yuhui Wang1×

Tanqiu Jiang1×

Charles Fleming1×

Ting Wang1×

Yao Ma1×

Tharindu Kumarage1×

Research Timeline

2026

ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System

ARES is a novel framework that systematically discovers and mitigates dual vulnerabilities in RLHF systems by simultaneously testing the core LLM and its Reward Model (RM) using structured adversarial prompts, leading to enhanced safety robustness.

MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory

The paper introduces MAGE, a novel defensive framework that uses a dedicated 'shadow memory' to proactively detect and mitigate long-horizon threats against LLM agents during complex, multi-step interactions.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.CLRecentMay 4, 2026

MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory

Yuhui Wang, Tanqiu Jiang, Jiacheng Liang, Charles Fleming +1 more

View →

cs.AIcs.CRcs.LGRecentApr 20, 2026