Shuo Yang

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×1ML×1

Frequent co-authors

Zheyu Zhang1×

Gjergji Kasneci1×

Research Timeline

2026

Consolidating Rewarded Perturbations for LLM Post-Training

The paper introduces CoRP, a gradient-free operator that consolidates the benefits of ensemble-based post-training methods into a single, deployable model update, significantly improving performance with minimal computational overhead.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.LGRecentMay 29, 2026

Consolidating Rewarded Perturbations for LLM Post-Training

Zheyu Zhang, Shuo Yang, Gjergji Kasneci

View →