2021
DOI: 10.3390/polym13111898
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model

Abstract: We propose a chemical language processing model to predict polymers’ glass transition temperature (Tg) through a polymer language (SMILES, Simplified Molecular Input Line Entry System) embedding and recurrent neural network. This model only receives the SMILES strings of a polymer’s repeat units as inputs and considers the SMILES strings as sequential data at the character level. Using this method, there is no need to calculate any additional molecular descriptors or fingerprints of polymers, and thereby, bein… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
38
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

3
6

Authors

Journals

citations
Cited by 56 publications
(39 citation statements)
references
References 81 publications
0
38
0
1
Order By: Relevance
“…We have implemented a 2D CNN model based on images and revealed its poor generalization ability for T g prediction in the recent study . Besides this, we have also implemented the RNN model that is purely linguistic-based using the SMILES notation of a repeat unit as input . We re-evaluate our previously trained models using the new 566 MD simulations and compare them here with other models.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We have implemented a 2D CNN model based on images and revealed its poor generalization ability for T g prediction in the recent study . Besides this, we have also implemented the RNN model that is purely linguistic-based using the SMILES notation of a repeat unit as input . We re-evaluate our previously trained models using the new 566 MD simulations and compare them here with other models.…”
Section: Resultsmentioning
confidence: 99%
“…In our case, we treat the SMILES notation of polymers as the input sequence to the RNN model and then specify the T g as the output token to be predicted. Our RNN model is realized and introduced in our recent study …”
Section: Datasets Models and Methodsmentioning
confidence: 99%
“…The integration of CGMD/ML is able to speed up the prediction of polymer properties at the chain level. Researchers have adopted ML models for the prediction of polymer properties mostly based on their monomer representation ( Ramprasad and Kim, 2019 ; Sattari et al, 2021 ; Chen et al, 2021b ; Gracheva et al, 2021 ), ignoring the influence of polymer chains, such as molecular weight, topology ( Tao et al, 2021a ), and copolymer sequence ( Kuenneth et al, 2021 ). Particularly for novel polymeric materials, there are limitations in the existing database due to unexplored chemical space ( Wilbraham et al, 2019 ).…”
Section: Application Of ML For Understanding and Design Of Polymer Chainsmentioning
confidence: 99%
“…Compared to the fully connected layers, the number of hidden neurons is drastically reduced by conventional and pool layers, thus allowing for far deeper networks. NNs have been applied to build ML models for the suitable selection of polymer-solvent pairs [99], glass transmission temperature [100] and thermal conductivity [25,77,95]. For the suitable selection of polymer-solvent pairs, a total of 11,958 polymer + good-solvent pairs and 8469 polymer + nonsolvent pairs were employed to train a binary classification NN model to judge whether a solvent is good or insoluble for a specific polymer [99].…”
Section: Algorithmmentioning
confidence: 99%