Christian Scherer
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
ML×1
Frequent co-authors
Research Timeline
2026
Coherent Off-Policy Improvement of Large Behavior Models with Learned Rewards
The paper proposes a coherent inverse reinforcement learning (IRL) method to improve large behavior models for robotic control, achieving superior sample efficiency and performance on complex sparse manipulation tasks compared to traditional RL baselines.
Highlighted terms show continued research focus across papers
Papers
cs.LGRecentJun 1, 2026
Coherent Off-Policy Improvement of Large Behavior Models with Learned Rewards
Christian Scherer, Joe Watson, Theo Gruner, Daniel Palenicek +2 more
The paper proposes a coherent inverse reinforcement learning (IRL) method to improve large behavior models for robotic control, achieving superior sample efficiency and performance on complex sparse m…
View →