Abstract:Abstract. In this paper we continue study of the games of prediction with expert advice with uncountably many experts. A convenient interpretation of such games is to construe the pool of experts as one "stochastic predictor", who chooses one of the experts in the pool at random according to the prior distribution on the experts and then replicates the (deterministic) predictions of the chosen expert. We notice that if the stochastic predictor's total loss is at most L with probability at least p then the lear… Show more
“…The proof of this theorem is given in Section 5. Note that while the algorithm may seem to be designed for the stochastic setting, we apply it to the pure adversarial case 9 and obtain the loss guarantees. At the same time, the adversarial loss bound (5) depends on the probability distribution p(•) for which the algorithm is designed.…”
Section: Guarantees Of Performancementioning
confidence: 99%
“…We consider the Decision-Theoretic Online Learning (DTOL) framework [1,2,3,4,5,3] which is closely related to the paradigm of prediction with expert advice [6,7,8,9,1,10,11,12]. A master algorithm at every step t = 1, .…”
The article is devoted to investigating the application of hedging strategies to online expert weight allocation under delayed feedback. As the main result we develop the General Hedging algorithm G based on the exponential reweighing of experts' losses. We build the artificial probabilistic framework and use it to prove the adversarial loss bounds for the algorithm G in the delayed feedback setting. The designed algorithm G can be applied to both countable and continuous sets of experts. We also show how algorithm G extends classical Hedge (Multiplicative Weights) and adaptive Fixed Share algorithms to the delayed feedback and derive their regret bounds for the delayed setting by using our main result.
“…The proof of this theorem is given in Section 5. Note that while the algorithm may seem to be designed for the stochastic setting, we apply it to the pure adversarial case 9 and obtain the loss guarantees. At the same time, the adversarial loss bound (5) depends on the probability distribution p(•) for which the algorithm is designed.…”
Section: Guarantees Of Performancementioning
confidence: 99%
“…We consider the Decision-Theoretic Online Learning (DTOL) framework [1,2,3,4,5,3] which is closely related to the paradigm of prediction with expert advice [6,7,8,9,1,10,11,12]. A master algorithm at every step t = 1, .…”
The article is devoted to investigating the application of hedging strategies to online expert weight allocation under delayed feedback. As the main result we develop the General Hedging algorithm G based on the exponential reweighing of experts' losses. We build the artificial probabilistic framework and use it to prove the adversarial loss bounds for the algorithm G in the delayed feedback setting. The designed algorithm G can be applied to both countable and continuous sets of experts. We also show how algorithm G extends classical Hedge (Multiplicative Weights) and adaptive Fixed Share algorithms to the delayed feedback and derive their regret bounds for the delayed setting by using our main result.
“…In this section we discuss basic aggregating algorithms for 1-step-ahead forecasting based on exponential reweighing. Our framework is built on the general aggregating algorithm G 1 by Vovk (1999), we discuss it in Subsection 3.1. The simplest and earliest version V 1 by Vovk (1998) of this algorithm is discussed in Subsection 3.2.…”
Section: Aggregating Algorithm For 1-step-ahead Forecastingmentioning
confidence: 99%
“…In this work, we investigate the problem of modifying aggregating algorithms based on exponential reweighing for the long-term forecasting. We consider the general aggregating algorithm by Vovk (1999) for the 1-step-ahead forecasting and provide its reasonable nonreplicated generalization for the D-th-step-ahead forecasting. These algorithms are denoted by G 1 and G D respectively.…”
Section: Introductionmentioning
confidence: 99%
“…In Section 3 we discuss the aggregating algorithms for the 1-step-ahead forecasting. In Subsection 3.1 we describe the general model G 1 by Vovk (1999) and consider its special case V 1 in Subsection 3.2.…”
The article is devoted to investigating the application of aggregating algorithms to the problem of the long-term forecasting. We examine the classic aggregating algorithms based on the exponential reweighing. For the general Vovk's aggregating algorithm we provide its generalization for the long-term forecasting. For the special basic case of Vovk's algorithm we provide its two modifications for the long-term forecasting. The first one is theoretically close to an optimal algorithm and is based on replication of independent copies. It provides the time-independent regret bound with respect to the best expert in the pool. The second one is not optimal but is more practical and has O( √ T ) regret bound, where T is the length of the game.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.