2023
DOI: 10.1021/acs.chemmater.2c02425
|View full text |Cite
|
Sign up to set email alerts
|

Revealing Hidden Patterns through Chemical Intuition and Interpretable Machine Learning: A Case Study of Binary Rare-Earth Intermetallics RX

Abstract: Machine learning algorithms have been applied successfully in many areas of materials chemistry but often suffer from an inability to extract chemical insight. To demonstrate that an approach combining machine learning and chemical intuition can be effective in generating interpretable models, the structures of binary equiatomic rare-earth intermetallics RX, whose relationships have long defied understanding, were investigated as a case study. A structure map was developed based on only two parameters, which a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 76 publications
0
5
0
Order By: Relevance
“…We note that although the primary features are physically motivated, the models generated by SISSO by combining primary features do not necessarily have a physical interpretation. An important advantage of SISSO is that it can work well with a relatively small amount of data. This is critical in this case because of the high computational cost of acquiring GW+BSE training data for molecular crystals. To train SISSO, we generated a data set containing GW+BSE SF driving force values of 101 PAH molecular crystals with up to ∼500 atoms in the unit cell (the PAH101 set).…”
Section: Resultsmentioning
confidence: 99%
“…We note that although the primary features are physically motivated, the models generated by SISSO by combining primary features do not necessarily have a physical interpretation. An important advantage of SISSO is that it can work well with a relatively small amount of data. This is critical in this case because of the high computational cost of acquiring GW+BSE training data for molecular crystals. To train SISSO, we generated a data set containing GW+BSE SF driving force values of 101 PAH molecular crystals with up to ∼500 atoms in the unit cell (the PAH101 set).…”
Section: Resultsmentioning
confidence: 99%
“…In total, there were 23 features: three to express the composition ( x , y , and z ), 10 for the elemental properties of M, and 10 for the elemental properties of X. These features are similar to those that were successfully used to tackle a related problem in the structural classification of binary rare-earth intermetallics . It has been hypothesized that many properties of an element can be derived merely from its position in the periodic table .…”
Section: Resultsmentioning
confidence: 99%
“…These compounds were examined because the distribution of the three structure types adopted (44 FeB, 33 TlI, and 29 CsCl) appears to be puzzlingly haphazard. Similar to how structure maps for other classes of compounds were historically developed through a combination of trial-and-error and chemical intuition, the elements R and X were rearranged in different sequences until a reasonable separation of these structure types was achieved . A good descriptor for the x axis of the RX structure map was found to be a simple combination of R-element features, x = Z r m , where Z is the atomic number and r m is the metallic radius.…”
Section: Resultsmentioning
confidence: 99%
“…For the main test cases, to compare the two approaches (DO and DT) for descriptor scoring, three data sets were examined. Crystal structure data were gathered for ABX 3 compounds (perovskites vs nonperovskites) from previous literature, for oxides AB 2 O 4 (spinels vs nonspinels) from Pearson’s Crystal Data , and for rare-earth intermetallics RX (CsCl, TlI, and FeB structure types) from our own recent work . The number of samples in these data sets, a schematic of the workflow, and the classification accuracies for the test sets are summarized in Figure .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation