2014
DOI: 10.1016/j.ijforecast.2013.07.005
|View full text |Cite
|
Sign up to set email alerts
|

A gradient boosting approach to the Kaggle load forecasting competition

Abstract: We describe and analyse the approach used by Team TinTin (Souhaib Ben Taieb and Rob J Hyndman) in the Load Forecasting track of the Kaggle Global Energy Forecasting Competition 2012. The competition involved a hierarchical load forecasting problem for a US utility with 20 geographical zones. The available data consisted of the hourly loads for the 20 zones and hourly temperatures from 11 weather stations, for four and a half years. For each zone, the hourly electricity load for nine different weeks needed to b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
116
0
4

Year Published

2015
2015
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 236 publications
(120 citation statements)
references
References 20 publications
0
116
0
4
Order By: Relevance
“…In TC, organizations post their problems to IT-mediated crowds on platforms such as Innocentive, Eyeka, and Kaggle (Afuah & Tucci 2012) or through in-house platforms such as Challenge.gov (Brabham 2013b). These platforms generally attract and maintain more or less specialized crowds premised on the platform's specific focus; for example, Eyeka's crowd creates advertising collateral for brands, while the crowd at Kaggle focuses on data science solutions (Ben Taieb & Hyndman 2013;Roth & Kimani 2013). When applied to innovation, these platforms have been termed open innovation platforms (Sawhney et al 2003), and represent both the idea generation and problem solving aspects of crowdsourcing (Brabham 2008;Morgan & Wang 2010).…”
Section: Tournament Crowdsourcing (Tc)mentioning
confidence: 99%
“…In TC, organizations post their problems to IT-mediated crowds on platforms such as Innocentive, Eyeka, and Kaggle (Afuah & Tucci 2012) or through in-house platforms such as Challenge.gov (Brabham 2013b). These platforms generally attract and maintain more or less specialized crowds premised on the platform's specific focus; for example, Eyeka's crowd creates advertising collateral for brands, while the crowd at Kaggle focuses on data science solutions (Ben Taieb & Hyndman 2013;Roth & Kimani 2013). When applied to innovation, these platforms have been termed open innovation platforms (Sawhney et al 2003), and represent both the idea generation and problem solving aspects of crowdsourcing (Brabham 2008;Morgan & Wang 2010).…”
Section: Tournament Crowdsourcing (Tc)mentioning
confidence: 99%
“…In a small daily load forecasting study, reference [5] compare four different methods to deal with public holidays were evaluated in a neural net framework. First, an additional holiday dummy (A1), second the holidays were treated as a Sunday (A2), third an additional factor variable is introduced (similarly as in [52]) was used to code the day before the holiday, the holiday itself, and the day after the holiday (A3) and fourth a methodology (A2) and (A3) combined. Reference [5] concludes that the method A3 performs best.…”
Section: Results For Forecasting Accuracymentioning
confidence: 99%
“…Most notable these are fuzzy linear regression approaches [10,49,50], such as factor variable approches as used in [51,52].…”
Section: Public Holidays With Separate Additional Dummies and Weekdaymentioning
confidence: 99%
“…In the latest competitions of energy forecasting, most of team entries tended to employ ensemble techniques, such as gradient boosting machines (GBMs) and quantile regression forests (QRFs), using more input features [4,5]. A multi-step forecasting strategy with component-wise gradient boosting [6] was used. In addition, nonparametric approaches had been used to do probabilistic forecasting problem, such as Gaussian process [7].…”
Section: Introductionmentioning
confidence: 99%