“…Different from early bounds that often rely on the complexity measures of the considered function classes, the recent PAC-Bayes bounds (McAllester, 1999;Seeger, 2002;Langford, 2005) give the tightest predictions of the generalization performance, for which the prior and posterior distributions of learners are involved on top of the PAC (Probably Approximately Correct) learning setting (Catoni, 2007;Germain et al, 2009). Beyond the common supervised learning, PAC-Bayes analysis has also been applied to other tasks, e.g., density estimation (Seldin and Tishby, 2010;Higgs and Shawe-Taylor, 2010) and reinforcement learning (Seldin et al, 2012).…”