XLGoBench: Detecting cross-lingual skill gaps with algorithmic tasks | ArxivCSExplorer