Polymer band gap is one of the most important properties associated with electric conductivity. In this work, the machine learning model called support vector regression (SVR) was developed to predict the polymer band gap, where the training data of the polymer band gap were obtained from DFT computation while the descriptors were generated from Dragon. After feature selection with the maximum relevance minimum redundancy, the SVR model using 16 key features as inputs gave the optimal performance for predicting polymer band gaps. The determination coefficient (R 2 ) of the SVR model between the DFT computations and SVR predictions of polymer band gaps reached as high as 0.824 for the leave-one-out cross-validation and 0.925 for the independent test. Besides, the 16 key features were explored through correlation analysis and sensitivity analysis. The available model can be used to screen out the polymers with targeted band gaps before experiments, which is very helpful for rapid design of new polymers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.