X4Val: Learning Neural Surrogates for Variance-Reduced Policy Evaluation | ArxivCSExplorer