Kun Zhan

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×2Info Retrieval×1AI×1ML×1

Frequent co-authors

OneRec Team1×

Biao Yang1×

Boyang Ding1×

Chenglong Chu1×

Dunju Zang1×

Fei Pan1×

Research Timeline

2026

HMPO: Hybrid Median-length Policy Optimization for Chain-of-Thought Compression

HMPO introduces a single-stage, cost-effective reinforcement learning framework that achieves significant token compression of Chain-of-Thought reasoning with minimal loss of accuracy, applicable across various large language model architectures.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

Highlighted terms show continued research focus across papers

Papers

cs.IRcs.AIcs.CLRecentJun 4, 2026

OneReason Technical Report

OneRec Team, Biao Yang, Boyang Ding, Chenglong Chu +80 more

View →

cs.LGcs.CLRecentJun 1, 2026