A regression tree (RT) was extensively utilized in quantitative structure-activity relationship studies (QSAR), due to its inherently promising attributes. The issues of instability and inclination to overfitting and suboptima, however, often occur in RT. In the present study, a robust version of boosting was invoked to simultaneously improve the stability and generalization ability of RT, forming a new method called robust boosting regression tree (RBRT). RBRT works by sequentially employing the RT method to model the robustly reweighted versions of the original training set and then aggregating these resultant predictors via weighted median. The designed RBRT was applied to predict the bioactivities of flavoniod derivatives and the anti-HIV activities of HIV-1 inhibitors, compared with boosting RT (BRT) and RT. The results of these two data sets demonstrated that the introduction of robust boosting drastically enhances the stability and generalization ability of RT, and RBRT is superior to BRT in QSAR studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.