2023
DOI: 10.26434/chemrxiv-2023-0nlzl
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine learning descriptors in materials chemistry: prediction and experimental validation synthesis of novel intermetallic UCd3

Abstract: Materials informatics uses data-driven approaches for the study and discovery of materials. Features or descriptors are the crucial components in generating reliable and accurate machine-learning models. While general data can be acquired through public and commercial sources, features must be tailored for a specific application. Common featurizers are suitable for generic chemical problems, but may not be ideal for solid state materials. Here, we have assembled the Oliynyk property list for feature generation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 40 publications
0
7
0
Order By: Relevance
“…The features had 63 properties of elements, including different size scales (obviously without Miracle), electronegativities, and physical properties, and DFT-generated data. The algorithms included [ 1 ] Support-vector machines (SVR), [2] Decision Tree, [3] Multi-layer Perceptron neural network (MLP-NN), [4] Cubic spline interpolation, and [ 5 ] Gaussian process regression (GPR). The root mean squared error (RMSE) values between the actual and predicted Miracle values were compared.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The features had 63 properties of elements, including different size scales (obviously without Miracle), electronegativities, and physical properties, and DFT-generated data. The algorithms included [ 1 ] Support-vector machines (SVR), [2] Decision Tree, [3] Multi-layer Perceptron neural network (MLP-NN), [4] Cubic spline interpolation, and [ 5 ] Gaussian process regression (GPR). The root mean squared error (RMSE) values between the actual and predicted Miracle values were compared.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
“…Here, we introduce the Oliynyk property list, a carefully curated Excel file containing 98 elemental features for atomic numbers from 1 to 93. This dataset has been used as the basis for featurization in various lab-tested studies [1] . If the properties had missing values, we predicted them to complete the dataset.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Besides ESF and EDF, the construction of weighted element-to-chemical property features was carried out using a separate dataset -the Oliynyk elemental descriptors (OELDs). 46 OELDs contain detailed information on the chemical and physical properties of elements in the periodic table. These weighted element-to-chemical property features are composed of two groups:…”
Section: Data Collection and Feature Generationmentioning
confidence: 99%
“…Also falling under this category are methods that leverage domain knowledge to design better data features, more commonly known as 'feature engineering'. 17,18 More data: in this branch of the data-centric approach, the attention is shied to increasing the number of data points. This is generally considered to be more signicant in view of a better-performing statistical model 19,20 and a compelling alternative to vast domain knowledge.…”
Section: Introductionmentioning
confidence: 99%