2023
DOI: 10.1021/acs.jcim.2c01389
|View full text |Cite
|
Sign up to set email alerts
|

Ring Repeating Unit: An Upgraded Structure Representation of Linear Condensation Polymers for Property Prediction

Abstract: Unique structure representation of polymers plays a crucial role in developing models for polymer property prediction and polymer design by data-centric approaches. Currently, monomer and repeating unit (RU) approximations are widely used to represent polymer structures for generating feature descriptors in the modeling of quantitative structure−property relationships (QSPR). However, such conventional structure representations may not uniquely approximate heterochain polymers due to the diversity of monomer c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 61 publications
0
9
0
Order By: Relevance
“… Polymer structure representation. Unique representation of the polyester molecular structure was done by using RRU, which is developed in our previous work 43 Descriptor generation.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“… Polymer structure representation. Unique representation of the polyester molecular structure was done by using RRU, which is developed in our previous work 43 Descriptor generation.…”
Section: Methodsmentioning
confidence: 99%
“…Herein, we report a data-driven strategy to enable evaluation of the relationship between molecular structure and T g of polyesters and further guide the design of polyesters with specific T g s. The workflow is illustrated in Figure 1. First, a multiple linear regression (MLR)-based QSPR model was developed by employing ring repeating unit (RRU)based structural representation 43,44 to uniquely represent polyesters and norm descriptors for feature engineering. The predictability, robustness, and chance correlation of the model were evaluated by external validation, internal validation, and Y-randomized analysis, respectively.…”
Section: Introductionmentioning
confidence: 99%
“…There are numerous molecular representations and ML models to choose from, and the performance of each combination can be task-dependent. , We evaluated some of the most common ML (SVM, RF, XGBoost) and DL (NN, LSTM) models to study the impact of polymer representations on model performance. To understand the impact of data augmentation, we compared canonical SMILES, augmented SMILES, , OHE of polymers (where each polymer is represented by a unique binary bit in the vector, see example in Figure S1 and Table S1), one-hot encoded molecular fragments (nonaugmented and augmented), string-based molecular fragments (iteratively rearranged and iteratively rearranged recombined fragments), ECFP6 for periodic polymer graphs, , and other common representations (SELFIES, BRICS (OHE), and ECFP6 of the monomer) …”
Section: Methodsmentioning
confidence: 99%
“…One of the most promising solutions that capture the repetitive or periodic nature of a polymer is the periodic graph representation where the two terminal ends of the polymer chain are connected, forming a ring. , Although this representation is not physically accurate, the periodic polymer graph effectively captures the bonding environment and periodicity of a polymer, outperforming the monomer, dimer, and trimer graph in several unique polymer prediction tasks …”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation