An Extra Tree Regression Model for Discharge Coefficient Prediction: Novel, Practical Applications in the Hydraulic Sector and Future Research Directions

Hameed, Mohammed Majeed; AlOmar, Mohamed Khalid; Khaleel, Faidhalrahman; Al‐Ansari, Nadhir

doi:10.1155/2021/7001710

Cited by 46 publications

(25 citation statements)

References 55 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As the ELM approach presents excellent performance through the training phase, it is essential to validate this model using testing data points. Several studies emphasized that comparable models can be evaluated more effectively through the testing phase [28,74]. e reason is that the model in the training phase would be trained in the presence of input points and their corresponding values.…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

The Influence of Data Length on the Performance of Artificial Intelligence Models in Predicting Air Pollution

AlOmar

Khaleel

Al-Saadi

et al. 2022

Advances in Meteorology

Self Cite

View full text Add to dashboard Cite

Air pollution is one of humanity's most critical environmental issues and is considered contentious in several countries worldwide. As a result, accurate prediction is critical in human health management and government decision-making for environmental management. In this study, three artificial intelligence (AI) approaches, namely group method of data handling neural network (GMDHNN), extreme learning machine (ELM), and gradient boosting regression (GBR) tree, are used to predict the hourly concentration of PM2.5 over a Dorset station located in Canada. The investigation has been performed to quantify the effect of data length on the AI modeling performance. Accordingly, nine different ratios (50/50, 55/45, 60/40, 65/35, 70/30, 75/25, 80/20, 85/15, and 90/10) are employed to split the data into training and testing datasets for assessing the performance of applied models. The results showed that the data division significantly impacted the model's capacity, and the 60/40 ratio was found more suitable for developing predictive models. Furthermore, the results showed that the ELM model provides more precise predictions of PM2.5 concentrations than the other models. Also, a vital feature of the ELM model is its ability to adapt to the potential changes in training and testing data ratio. To summarize, the results reported in this study demonstrated an efficient method for selecting the optimal dataset ratios and the best AI model to predict properly which would be helpful in the design of an accurate model for solving different environmental issues.

show abstract

Section: Resultsmentioning

confidence: 99%

“…Machine learning (ML) has made tremendous progress in recent years in solving numerous engineering in general [27][28][29][30][31][32] and PM 2.5 concentration in particular [33][34][35][36][37][38][39][40][41][42]. ML combines data science, statistics, and computing in an interdisciplinary fashion.…”

Section: Previous Workmentioning

confidence: 99%

The Influence of Data Length on the Performance of Artificial Intelligence Models in Predicting Air Pollution

AlOmar

Khaleel

Al-Saadi

et al. 2022

Advances in Meteorology

Self Cite

View full text Add to dashboard Cite

show abstract

“…ese include 52 specimens from Lu [36], 37 specimens from Ludwig and Nunes [37], 25 specimens from Hameed et al [38], 53 specimens from Hameed et al [39], 12 specimens from Naser and Alavi [40], 12 specimens from Ludwig et al [41], 6 specimens by Nguyen et al [42], 12 specimens by Yaseen et al [43], 19 specimens by Zhang et al [44], 39 specimens by Gong, and 4 specimens by Gandomi et al [45]. It is worth noting that the database contains a wide range of RC deep beams to improve the generativity of the model.…”

Section: Shear Strength Of Rc Deep Beams and Data Collectionmentioning

confidence: 99%

Optimising the Selection of Input Variables to Increase the Predicting Accuracy of Shear Strength for Deep Beams

et al. 2022

Self Cite

View full text Add to dashboard Cite

The deep beam in load transfer is very important as well as difficult to design due to its shear stress problems. Accurate estimation of shear stress would help engineers to get a safer design. One of the major obstacles in building an accurate prediction model is optimising the input variables. Therefore, developing an efficient algorithm to select the optimal input parameters that have the highest information content to represent the target and minimise redundant data is very important. The feature-section algorithm based on the combination of genetic algorithm and information theory (GAITH) was used to select the most important input combinations and introduce them into the prediction models. Four models were used in this study: locally weighted linear regression (LWLR) based on the radial basis kernel function, multiple linear regression (MLR), extreme learning machine (ELM), and random forest (RF). The study found that all applied models were significantly improved by the presence of the GAITH algorithm, except for the MLR model. The LWLR-GAITH model showed 29.15% to 47.88% higher performance accuracy in terms of root mean square error (RMSE) than the other hybrid models during the test phase. Moreover, the results of the standard models (without using the GAITH algorithm) proved the superiority of the LWLR model in reducing the RMSE by 34.51%, 55.17%, and 35.35% compared to RF, MLR, and ELM, respectively. Thus, the inclusion of the LWLR model with GAITH has demonstrated a reliable and applicable computer aid for modelling shear strength in deep beams.

show abstract

“…In this study, the extra-trees method will be used to forecast time-series data, namely the daily closing stock price with the basic autoregressive model. Several studies related to extra-tree decision trees such as the highly random decision tree method (Geurts, et al, 2006), extra-trees regression model for prediction of discharge coefficient in the hydraulic sector (Hameed, et al, 2021), as well as an ensemble cascading extremely randomized tree approach for short-term traffic flow prediction (Zhang, et al, 2019). Several other studies used extra trees and several other tree-based methods as a comparison in modeling phenomena such as prediction of daily precipitation and temperature (Jose, et al, 2022) and prediction of blood cancer (Rupapara, et al, 2022).…”

Section: Figure 1 Example Of a Decision Treementioning

confidence: 99%

Extra Trees Method for Stock Price Forecasting With Rolling Origin Accuracy Evaluation

Mahkya

Notodiputro

Sartono

2022

Medstat

View full text Add to dashboard Cite

Stock is an investment instrument that has risk in its management. One effort to minimize this risk is to model and make further forecasts of stock price movements. Time series data forecasting with autoregressive models is often found in several cases with the most popular approach being the ARIMA model. The tree-based method is one of the algorithms that can be used to forecast both in classification and regression. One ensemble approach to tree-based methods is Extra Trees. This study aims to forecast using the Extra Trees algorithm by evaluating forecasting accuracy with Rolling Forecast Origin on BRMS stock price data. Based on the results obtained, it is known that Extra Trees produces a fairly good accuracy for forecasting up to 6 days after training data with a MAPE of less than 0.1%.

show abstract

An Extra Tree Regression Model for Discharge Coefficient Prediction: Novel, Practical Applications in the Hydraulic Sector and Future Research Directions

Abstract: Despite modern advances used to estimate the discharge coefficient ( C d ), it is still a major challenge for hydraulic engineers to accurately determine … Show more

Cited by 46 publications

References 55 publications

The Influence of Data Length on the Performance of Artificial Intelligence Models in Predicting Air Pollution

The Influence of Data Length on the Performance of Artificial Intelligence Models in Predicting Air Pollution

Optimising the Selection of Input Variables to Increase the Predicting Accuracy of Shear Strength for Deep Beams

Extra Trees Method for Stock Price Forecasting With Rolling Origin Accuracy Evaluation

Contact Info

Product

Resources

About