Christian Scherer

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

ML×1

Frequent co-authors

Joe Watson1×

Theo Gruner1×

Daniel Palenicek1×

Ingmar Posner1×

Jan Peters1×

Research Timeline

2026

Coherent Off-Policy Improvement of Large Behavior Models with Learned Rewards

The paper proposes a coherent inverse reinforcement learning (IRL) method to improve large behavior models for robotic control, achieving superior sample efficiency and performance on complex sparse manipulation tasks compared to traditional RL baselines.

Highlighted terms show continued research focus across papers

Papers

cs.LGRecentJun 1, 2026

Coherent Off-Policy Improvement of Large Behavior Models with Learned Rewards

Christian Scherer, Joe Watson, Theo Gruner, Daniel Palenicek +2 more

View →