Certified Policy Optimisation for Nested Causal Bandits via PAC-Bayes Risk | ArxivCSExplorer