Zhezheng Hao

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×2Multiagent×1

Frequent co-authors

Ziyan Liu2×

Hong Wang2×

Yeqiu Chen1×

Jingren Hou1×

Ruiyi Ding1×

Yongkang Yang1×

Research Timeline

2026

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

The paper introduces Metacognitive Memory Policy Optimization (MMPO), a novel memory training approach that optimizes LLM memory not based on final task success, but on minimizing epistemic uncertainty in intermediate summaries, significantly improving long-horizon agent performance.

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

The paper proposes Meta-Team, an experience-driven framework that enables multi-agent systems (MAS) to collaboratively self-evolve by transforming complex execution experiences into reusable improvements for agent behaviors and coordination.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 28, 2026

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

Ziyan Liu, Zhezheng Hao, Yeqiu Chen, Hong Wang +6 more

View →

cs.MAcs.AIRecentMay 28, 2026