2023
DOI: 10.1038/s41524-023-01016-5
|View full text |Cite
|
Sign up to set email alerts
|

TransPolymer: a Transformer-based language model for polymer property predictions

Abstract: Accurate and efficient prediction of polymer properties is of great significance in polymer design. Conventionally, expensive and time-consuming experiments or simulations are required to evaluate polymer functions. Recently, Transformer models, equipped with self-attention mechanisms, have exhibited superior performance in natural language processing. However, such methods have not been investigated in polymer sciences. Herein, we report TransPolymer, a Transformer-based language model for polymer property pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
59
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 55 publications
(60 citation statements)
references
References 57 publications
1
59
0
Order By: Relevance
“…The fine-tuned models are evaluated based on their accuracy when predicting properties for polymers unseen in training. The evaluation metrics are relative root-mean-square error (RMSE) and R 2 with a 5-fold cross-validation, which are similar to those used for Trans-Polymer 15 and polyBERT. 16 The relative RMSE is defined as the RMSE scaled by the difference between the maximum and minimum values of each property.…”
Section: ■ Core Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The fine-tuned models are evaluated based on their accuracy when predicting properties for polymers unseen in training. The evaluation metrics are relative root-mean-square error (RMSE) and R 2 with a 5-fold cross-validation, which are similar to those used for Trans-Polymer 15 and polyBERT. 16 The relative RMSE is defined as the RMSE scaled by the difference between the maximum and minimum values of each property.…”
Section: ■ Core Resultsmentioning
confidence: 99%
“…In similar fashion to Kuenneth et al, 16,34 we employ a multitasking approach which predicts all properties simultaneously for better data efficiency. Our results show that the models pretrained on Enamine REAL achieve similar accuracy to the TransPolymer and polyBERT work, thereby suggesting that a model pretrained on a large-scale, accessible, and general molecular 4, 15 which has neither Eat nor STD values, and polyBERT is reproduced from Kuenneth and Ramprasad's Figure 5 and Table S1. 16 SML-MT, pretrained on small molecules, achieves similar accuracy to TransPolymer and polyBERT, pretrained on augmented polymer data sets.…”
Section: ■ Introductionmentioning
confidence: 87%
“…These graphs capture the composition and architecture of polydisperse polymers, and have improved prediction of polymer properties compared to molecular descriptors alone. , For example, they have been successful in predicting catalysis conditions for ring-opening polymerizations across multiple different data sets . TransPolymer, developed from the Transformer-based language processing algorithm, has been successfully trained on diverse polymer data sets for different bulk properties . Additionally, graph representation has been successful in theoretical studies, such as predicting the radius of gyration ( R g ) of coarse-grained model polymers with defined sequence and composition …”
Section: Library Synthesis Methodsmentioning
confidence: 99%
“…Other types of dimensionality reduction tools include t-distributed stochastic neighbor embedding (t-SNE), designed to visualize high-dimensional data using a mapping onto two- or three-dimensions, and uniform manifold approximation and projection (UMAP), a topology driven technique . For example, the TransPolymer model uses t-SNE to visualize millions of unlabeled training data points and data from specific property databases . Many other types of dimension reduction techniques are detailed thoroughly by Banerjee and co-workers …”
Section: High-throughput Characterization and Analysis Of Polymer Lib...mentioning
confidence: 99%
See 1 more Smart Citation