Croissant Tasks: A Metadata Format for Reproducible Machine Learning Evaluations | ArxivCSExplorer