2011
DOI: 10.2202/1559-0410.1367
|View full text |Cite
|
Sign up to set email alerts
|

Using Tree Ensembles to Analyze National Baseball Hall of Fame Voting Patterns: An Application to Discrimination in BBWAA Voting

Abstract: We predict the induction of Major League Baseball hitters and pitchers into the National Baseball Hall of Fame by the Baseball Writers' Association of America. We employ a Random Forest algorithm for binary classification, improving upon past models with a simplistic input approach. Our results suggest that the random forest technique is a fruitful line of research with prediction in the sports world. We find an error rate as low as 0.91% in our most accurate forest, with no out-of-bag Error higher than 2.6% i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…Furthermore, tree-based ensemble methods have already been applied in MLB research. Mills & Salaga (2011) and Freiman (2010) predict the election of hitters and pitchers into the National Hall of Fame by the Baseball Writers' Association by RF classification and Swartz et al (2017) estimate pitch quality by RF regression.…”
Section: Predicting Game Attendance and Determinants Of Demandmentioning
confidence: 99%
“…Furthermore, tree-based ensemble methods have already been applied in MLB research. Mills & Salaga (2011) and Freiman (2010) predict the election of hitters and pitchers into the National Hall of Fame by the Baseball Writers' Association by RF classification and Swartz et al (2017) estimate pitch quality by RF regression.…”
Section: Predicting Game Attendance and Determinants Of Demandmentioning
confidence: 99%
“…Some studies addressed the topic of predicting future baseball HOFers, such as Kaufman and Kaufman (1996), Findlay and Reid (2002), Cohen (2004), Mills andSalaga (2011), andZemler (2013). First, Kaufman and Kaufman (1996) used a multiple regression analysis to predict pitchers' HOFers, founding that 60 to 73% of the variance in the pitcher's HOF selection could be explained by the predictors selected.…”
Section: Prediction Models To Simulatementioning
confidence: 99%
“…While the results from these studies are sensitive to model specification, there is evidence of racial and ethnic bias in the HOF process (Findlay and Reid 1997; Desser, Monks and Robinson 1999; Jewell, Brown and Miles 2002). However, racial and ethnic bias does not appear to delay induction into the HOF nor alter the composition of the HOF (Findlay and Reid 1997; Jewell 2003; Mills and Salaga 2011).…”
Section: Introductionmentioning
confidence: 97%