Reinforcement Learning with Robust Rubric Rewards | ArxivCSExplorer