Zilong Zheng

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×1AI×1Info Retrieval×1

Frequent co-authors

Jiaqi Li2×

Xiaobo Wang1×

Tong Wu1×

Min Tang1×

Qi Liu1×

Zhixin Cai1×

Research Timeline

2026

Xetrieval: Mechanistically Explaining Dense Retrieval

Xetrieval introduces an embedding-level framework to mechanistically explain dense retrieval decisions by decomposing high-dimensional embeddings into sparse, human-interpretable features.

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

The paper introduces SAVE, a framework that uses on-policy feedback and the value function to self-supervise and improve reward models, significantly enhancing RLHF performance across multiple benchmarks.

Highlighted terms show continued research focus across papers

Papers

cs.CLRecentMay 29, 2026

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Xiaobo Wang, Tong Wu, Min Tang, Jiaqi Li +2 more

View →

cs.AIcs.IRRecentMay 28, 2026