“…In recent years, the data‐based models, which feature little dependence on a priori knowledge and capability of capturing the characteristics of sophisticated process, have gained more and more attention in the prediction of chemical process outputs . These data‐based prediction models are mainly derived from nine base models, that is, multiple linear regression, partial least squares, principal component regression, artificial neural network, TREE, K nearest neighbors, support vector regression (SVR), Gaussian process regression (GPR), and random forest (RF) . According to Kadlec et al and Harrington et al, each type of the base model has its own merits, whereas no one is definitely superior to another and able to exactly capture the global characteristics of practical processes.…”