Xiaohan Wang

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×2AI×1Crypto×1

Frequent co-authors

Jiajun Chai2×

Wei Lin2×

Guojun Yin2×

Yaocheng Zhang1×

Yuqian Fu1×

Songjun Tu1×

Research Timeline

2026

On the Hidden Costs of Counterfactual Knowledge Training in LLM Unlearning

This paper analyzes the limitations of Counterfactual Knowledge Training (CFT) for LLM unlearning, identifying knowledge conflict and hallucination spillover as major pitfalls that hinder its effectiveness.

ZipRL: Adaptive Multi-Turn Context Compression with Hindsight Response Replay

ZipRL introduces an adaptive context compression framework that significantly improves the performance and efficiency of LLMs in complex, multi-turn agent tasks by combining multi-granularity compression with Hindsight Response Replay.

Are Full Rollouts Necessary for On-Policy Distillation?

This paper proposes two horizon-control strategies, Progressive OPD (POPD) and Truncated OPD (TOPD), demonstrating that full rollouts are often unnecessary for On-Policy Distillation, leading to significant improvements in training efficiency.

Highlighted terms show continued research focus across papers

Papers

cs.CLRecentMay 29, 2026

Are Full Rollouts Necessary for On-Policy Distillation?

Yaocheng Zhang, Jiajun Chai, Yuqian Fu, Songjun Tu +6 more

View →

cs.AIRecentMay 27, 2026