Deep Research as Rubric for Reinforcement Learning | ArxivCSExplorer