2021
DOI: 10.1007/s11036-020-01699-w
|View full text |Cite
|
Sign up to set email alerts
|

MapReduce-Based Improved Random Forest Model for Massive Educational Data Processing and Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 25 publications
0
4
0
Order By: Relevance
“…For instance, the idea of creating GBT to analyze and predict student performance was recently applied to the Brazilian public school system. The study proved that internal grades and number of absences are some of the major attributes that helped to generate good accuracy of the predicting model [45]. In addition, to forecast the dropout ratio of registered students for online and on-campus studies, the gradient boosted decision tree model has been applied several times and successfully proved its prediction capability [46][47][48].…”
Section: Gradient Boosted Treesmentioning
confidence: 88%
See 1 more Smart Citation
“…For instance, the idea of creating GBT to analyze and predict student performance was recently applied to the Brazilian public school system. The study proved that internal grades and number of absences are some of the major attributes that helped to generate good accuracy of the predicting model [45]. In addition, to forecast the dropout ratio of registered students for online and on-campus studies, the gradient boosted decision tree model has been applied several times and successfully proved its prediction capability [46][47][48].…”
Section: Gradient Boosted Treesmentioning
confidence: 88%
“…By building multiple trees, the approach minimizes the variance and typically provides more accurate results [44]. This method was applied several times on the educational dataset dealing with different problems [13,45,46]. Therefore, the separation of multiple attributes using splitting rules make it easy to understand and useful for prediction purposes.…”
Section: Random Forestmentioning
confidence: 99%
“…Mu et al [19] introduced the Pearson correlation coefficient to determine the optimal split attribute and split point during decision tree growth and trained decision trees in parallel using MapReduce technology. Xu W et al [20] calculated the information gain of various features on MapReduce computing framework by introducing a feature weighting system and improving existing data analysis with evaluation metrics. Chen et al [21] combined dataparallel and task-parallel optimization methods to reduce communication costs between data and workload imbalances.…”
Section: Introductionmentioning
confidence: 99%
“…Supervised classification is widely used in almost every area nowadays. Its application in medicine [1][2][3], agriculture [4][5][6], education [7][8][9][10], sports [11][12][13], and social behavior in business [14][15][16], among other areas [17][18][19], is indisputable. However, it is well known that there is no classifier having an overall superior performance to the remaining ones for all problems.…”
Section: Introductionmentioning
confidence: 99%