Accelerating the prediction of CO2 capture at low partial pressures in metal-organic frameworks using new machine learning descriptors

Orhan, Ibrahim B.; Le, Tu C.; Babarao, Ravichandar; Thornton, Aaron W.

doi:10.1038/s42004-023-01009-x

Cited by 13 publications

(5 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…95 Another new descriptor, effective point charge, was recently introduced and used together with the Henry coefficients of CO 2 in ML models to predict CO 2 capture properties of MOFs at very low-pressure conditions mimicking direct air capture. 96 Development and usage of new features in the future will lead to much accurate ML models.…”

Section: Discussionmentioning

confidence: 99%

“…For example, energy-based descriptors, including Gibbs free energy and Boltzmann weighted energy distributions of xenon (Xe) and krypton (Kr) gases, were demonstrated to be more important for determining Xe/Kr selectivities of MOFs compared to their structural and chemical features . Another new descriptor, effective point charge, was recently introduced and used together with the Henry coefficients of CO 2 in ML models to predict CO 2 capture properties of MOFs at very low-pressure conditions mimicking direct air capture . Development and usage of new features in the future will lead to much accurate ML models.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Accelerated Discovery of Metal–Organic Frameworks for CO₂ Capture by Artificial Intelligence

Gulbalkan,

Aksu,

Ercakir

et al. 2023

Ind. Eng. Chem. Res.

View full text Add to dashboard Cite

The existence of a very large number of porous materials is a great opportunity to develop innovative technologies for carbon dioxide (CO 2 ) capture to address the climate change problem. On the other hand, identifying the most promising adsorbent and membrane candidates using iterative experimental testing and brute-force computer simulations is very challenging due to the enormous number and variety of porous materials. Artificial intelligence (AI) has recently been integrated into molecular modeling of porous materials, specifically metal−organic frameworks (MOFs), to accelerate the design and discovery of high-performing adsorbents and membranes for CO 2 adsorption and separation. In this perspective, we highlight the pioneering works in which AI, molecular simulations, and experiments have been combined to produce exceptional MOFs and MOF-based composites that outperform traditional porous materials in CO 2 capture. We outline the future directions by discussing the current opportunities and challenges in the field of harnessing experiments, theory, and AI for accelerated discovery of porous materials for CO 2 capture.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Accelerated Discovery of Metal–Organic Frameworks for CO₂ Capture by Artificial Intelligence

Gulbalkan,

Aksu,

Ercakir

et al. 2023

Ind. Eng. Chem. Res.

View full text Add to dashboard Cite

show abstract

“…To assist in this endeavor, computational techniques such as molecular simulations and density-functional theory − were used to screen large MOF data sets. Alternatively, machine learning (ML) approaches were exploited to further accelerate MOF discovery. − Based on a training sample, a descriptor-based ML model is learned, for e.g., kernel ridge regression, random forests, or gradient boosting regression trees, ,− to predict electronic and gas adsorption properties of unseen samples. Recently, deep learning methods such as crystal graph convolutional neural networks (CGCNNs , ) and transformer-based models ,, were also investigated.…”

Section: Introductionmentioning

confidence: 99%

Informative Training Data for Efficient Property Prediction in Metal–Organic Frameworks by Active Learning

Jose,

Devijver,

Jakse

et al. 2024

J. Am. Chem. Soc.

View full text Add to dashboard Cite

In recent data-driven approaches to material discovery, scenarios where target quantities are expensive to compute and measure are often overlooked. In such cases, it becomes imperative to construct a training set that includes the most diverse, representative, and informative samples. Here, a novel regression tree-based active learning algorithm is employed for such a purpose. It is applied to predict the band gap and adsorption properties of metal−organic frameworks (MOFs), a novel class of materials that results from the virtually infinite combinations of their building units. Simpler and low dimensional descriptors, such as those based on stoichiometric and geometric properties, are used to compute the feature space for this model owing to their ability to better represent MOFs in the low data regime. The partitions given by a regression tree constructed on the labeled part of the data set are used to select new samples to be added to the training set, thereby limiting its size while maximizing the prediction quality. Tests on the QMOF, hMOF, and dMOF data sets reveal that our method constructs small training data sets to learn regression models that predict the target properties more efficiently than existing active learning approaches, and with lower variance. Specifically, our active learning approach is highly beneficial when labels are unevenly distributed in the descriptor space and when the label distribution is imbalanced, which is often the case for real world data. The regions defined by the tree help in revealing patterns in the data, thereby offering a unique tool to efficiently analyze complex structure−property relationships in materials and accelerate materials discovery.

show abstract

“…Additionally, Hou et al (2022) developed a deep learning model based on Bidirectional long short-term memory with Channel and Spatial Attention network (BCSA) using the molecular Simplified Molecular Input Line Entry System (SMILES) to predict aqueous solubility [14]. Furthermore, machine learning aids in characterizing the absorption and adsorption kinetics of CO 2 in ionic liquids and metal-organic frameworks by predicting its HLC using Random Forest (RF), Multiple Linear Regression (MLP), and Support Vector Machine (SVM) [15][16][17]. Wang et al (2020) used an Adaptive Neuro-Fuzzy Inference System (ANFIS) and a Least Squares Support Vector Machine (LSSVM) to predict HLC in water based on the molecular structure of compounds.…”

Section: Introductionmentioning

confidence: 99%

Machine Learning Approach for the Estimation of Henry’s Law Constant Based on Molecular Descriptors

Ullah,

Shaheryar,

Lim

2024

Atmosphere

View full text Add to dashboard Cite

In atmospheric chemistry, the Henry’s law constant (HLC) is crucial for understanding the distribution of organic compounds across gas, particle, and aqueous phases. Quantitative structure–property relationship (QSPR) models described in scientific research are generally tailored to specific groups or categories of substances and are often developed using a limited set of experimental data. This study developed a machine learning model using an extensive dataset of experimental HLCs for approximately 1100 organic compounds. Molecular descriptors calculated using alvaDesc software (v 2.0) were used to train the models. A hybrid approach was adopted for feature selection, ensuring alignment with the domain knowledge. Based on the root mean squared error (RMSE) of the training and test data after cross-validation, Gradient Boosting (GB) was selected as a model for predicting HLC. The hyperparameters of the selected model were optimized using the automated hyperparameter optimization framework Optuna. The impact of features on the target variable was assessed using the SHapley Additive exPlanations (SHAP). The optimized model demonstrated strong performance across the training, evaluation, and test datasets, achieving coefficients of determination (R2) of 0.96, 0.78, and 0.74, respectively. The developed model was used to estimate the HLC of compounds associated with carbon capture and storage (CCS) emissions and secondary organic aerosols.

show abstract

Accelerating the prediction of CO2 capture at low partial pressures in metal-organic frameworks using new machine learning descriptors

Cited by 13 publications

References 39 publications

Accelerated Discovery of Metal–Organic Frameworks for CO₂ Capture by Artificial Intelligence

Accelerated Discovery of Metal–Organic Frameworks for CO₂ Capture by Artificial Intelligence

Informative Training Data for Efficient Property Prediction in Metal–Organic Frameworks by Active Learning

Machine Learning Approach for the Estimation of Henry’s Law Constant Based on Molecular Descriptors

Contact Info

Product

Resources

About

Accelerating the prediction of CO2 capture at low partial pressures in metal-organic frameworks using new machine learning descriptors

Cited by 13 publications

References 39 publications

Accelerated Discovery of Metal–Organic Frameworks for CO2 Capture by Artificial Intelligence

Accelerated Discovery of Metal–Organic Frameworks for CO2 Capture by Artificial Intelligence

Informative Training Data for Efficient Property Prediction in Metal–Organic Frameworks by Active Learning

Machine Learning Approach for the Estimation of Henry’s Law Constant Based on Molecular Descriptors

Contact Info

Product

Resources

About

Accelerated Discovery of Metal–Organic Frameworks for CO₂ Capture by Artificial Intelligence

Accelerated Discovery of Metal–Organic Frameworks for CO₂ Capture by Artificial Intelligence