MOFSimplify, machine learning models with extracted stability data of three thousand metal–organic frameworks

Nandy, Aditya; Terrones, Gianmarco; Arunachalam, N.; Duan, Chenru; Kastner, David W.; Kulik, Heather J.

doi:10.1038/s41597-022-01181-0

Cited by 70 publications

(71 citation statements)

References 81 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These firstgeneration ML models were naturally constrained in their performance, in part by the representations or ML models themselves but primarily by the amount of data that could be easily extracted from the literature. Toward that end, we have provided our ML models and extracted properties from the data sets in a user-friendly web interface, which welcomes both feedback and the incorporation of additional community data 75 . We expect our approach, including natural extensions to other properties, to accelerate the time to discovery of stable, practical MOF materials by both computational and experimental researchers.…”

Section: Discussionmentioning

confidence: 99%

“…However, expert review of the literature could always be expected to improve the quality of our data sets and models. Toward that end, we have provided our ML models and extracted properties from the data sets in a user-friendly web interface, which welcomes both feedback and the incorporation of additional community data . The web interface also supports the uploading of new MOFs not in our data set, allowing researchers to calculate both the RAC features and ML-predicted properties on these new materials.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal–Organic Frameworks

2021

Self Cite

View full text Add to dashboard Cite

Although the tailored metal active sites and porous architectures of MOFs hold great promise for engineering challenges ranging from gas separations to catalysis, a lack of understanding of how to improve their stability limits their use in practice. To overcome this limitation, we extract thousands of published reports of the key aspects of MOF stability necessary for their practical application: the ability to withstand high temperatures without degrading and the capacity to be activated by removal of solvent molecules. From nearly 4,000 manuscripts, we use natural language processing and automated image analysis to obtain over 2,000 solvent-removal stability measures and 3,000 thermal degradation temperatures. We analyze the relationships between stability properties and the chemical and geometric structures in this set to identify limits of prior heuristics derived from smaller sets of MOFs. By training predictive machine learning (ML, i.e., Gaussian process and artificial neural network) models to encode the structure-property relationships with graph-and pore-structure-based representations, we are able to make predictions of stability orders of magnitude faster than conventional physicsbased modeling or experiment. Interpretation of important features in ML models provides insights that we use to identify strategies to engineer increased stability into typically unstable 3d-containing MOFs that are frequently targeted for catalytic applications. We expect our approach to accelerate the time to discovery of stable, practical MOF materials for a wide range of applications.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal–Organic Frameworks

2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…Here is where machine learning could play a disruptive role during the coming years in order to identify, screen, classify, and correlate MOFs’ potentials on the basis of geometric, chemical, topological, energetic, and performance-based descriptors [ 316 , 317 ]. In fact, machine learning is already having a deep impact on unraveling synthesis paths and engineering strategies of MOFs for gas adsorption and separation purposes [ 318 , 319 , 320 , 321 ]. As far as the investigations of MOFs for photo-oxidative and photoreductive processes expand, it is more likely that machine learning could be applied to unravel the underpinning chemical–physical features that make the MOFs feasible for this application.…”

Section: Future Perspectives Of Mofs For Chromium Photoreductionmentioning

confidence: 99%

A State-of-the-Art of Metal-Organic Frameworks for Chromium Photoreduction vs. Photocatalytic Water Remediation

et al. 2022

View full text Add to dashboard Cite

Hexavalent chromium (Cr(VI)) is a highly mobile cancerogenic and teratogenic heavy metal ion. Among the varied technologies applied today to address chromium water pollution, photocatalysis offers a rapid reduction of Cr(VI) to the less toxic Cr(III). In contrast to classic photocatalysts, Metal-Organic frameworks (MOFs) are porous semiconductors that can couple the Cr(VI) to Cr(III) photoreduction to the chromium species immobilization. In this minireview, we wish to discuss and analyze the state-of-the-art of MOFs for Cr(VI) detoxification and contextualizing it to the most recent advances and strategies of MOFs for photocatalysis purposes. The minireview has been structured in three sections: (i) a detailed discussion of the specific experimental techniques employed to characterize MOF photocatalysts, (ii) a description and identification of the key characteristics of MOFs for Cr(VI) photoreduction, and (iii) an outlook and perspective section in order to identify future trends.

show abstract

“…Although both the local descriptors (e.g. CGCNN, 19,21 chemical descriptors, 18 RACs, 22,23 and building-block embedding. 11,24,25 ) and the global features (e.g., geometric features calculated by ZEO++, 26 the histograms of energy-grids.…”

Section: Introductionmentioning

confidence: 99%

MOFTransformer: A Multi-modal Pre-training Transformer for Universal Transfer Learning in Metal-Organic Frameworks

Kang

Park

Smit

et al. 2022

Preprint

View full text Add to dashboard Cite

In this work, we introduce MOFTransformer, a multi-model Transformer encoder pre-trained with 1 million hypothetical MOFs. The multi-modal model uses an integrated atom-based graph and energy-grid embeddings to capture both the local and global features of the MOFs, respectively. By fine-tuning the pre-trained model with small datasets (from 5,000 to 20,000), our model outperforms all other machine learning models across various properties that include gas adsorption, diffusion, electronic properties, and even text mined data. Beyond its universal transfer learning capabilities, MOFTransformer generates chemical insight by analyzing feature importance from attention scores within the self-attention layers. As such, this model can serve as a bedrock platform for other MOF researchers that seek to develop new machine learning models for their work.

show abstract

MOFSimplify, machine learning models with extracted stability data of three thousand metal–organic frameworks

Cited by 70 publications

References 81 publications

Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal–Organic Frameworks

Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal–Organic Frameworks

A State-of-the-Art of Metal-Organic Frameworks for Chromium Photoreduction vs. Photocatalytic Water Remediation

MOFTransformer: A Multi-modal Pre-training Transformer for Universal Transfer Learning in Metal-Organic Frameworks

Contact Info

Product

Resources

About