On the Optimizer Dependence of Neural Scaling Laws | ArxivCSExplorer