Jiajun Chai

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×1AI×1

Frequent co-authors

Xiaohan Wang2×

Wei Lin2×

Guojun Yin2×

Yaocheng Zhang1×

Yuqian Fu1×

Songjun Tu1×

Research Timeline

2026

ZipRL: Adaptive Multi-Turn Context Compression with Hindsight Response Replay

ZipRL introduces an adaptive context compression framework that significantly improves the performance and efficiency of LLMs in complex, multi-turn agent tasks by combining multi-granularity compression with Hindsight Response Replay.

Are Full Rollouts Necessary for On-Policy Distillation?

This paper proposes two horizon-control strategies, Progressive OPD (POPD) and Truncated OPD (TOPD), demonstrating that full rollouts are often unnecessary for On-Policy Distillation, leading to significant improvements in training efficiency.

Highlighted terms show continued research focus across papers

Papers

cs.CLRecentMay 29, 2026

Are Full Rollouts Necessary for On-Policy Distillation?

Yaocheng Zhang, Jiajun Chai, Yuqian Fu, Songjun Tu +6 more

View →

cs.AIRecentMay 27, 2026