Efficient Exploration for Iterative Nash Preference Optimization | ArxivCSExplorer