2021
DOI: 10.1021/acscatal.0c04525
|View full text |Cite
|
Sign up to set email alerts
|

Open Catalyst 2020 (OC20) Dataset and Community Challenges

Abstract: Catalyst discovery and optimization is key to solving many societal and energy challenges including solar fuel synthesis, long-term energy storage, and renewable fertilizer production. Despite considerable effort by the catalysis community to apply machine learning models to the computational catalyst discovery process, it remains an open challenge to build models that can generalize across both elemental compositions of surfaces and adsorbate identity/configurations, perhaps because datasets have been smaller… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
455
0
4

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 381 publications
(526 citation statements)
references
References 93 publications
3
455
0
4
Order By: Relevance
“…This is addressed in reference [ 8 ] by using the sure-independence-screening-and-sparsifying-operator approach [ 32 ]. Similarly, we note that other AI strategies have been developed and applied for the accurate estimation of adsorption energies [ 33 , 34 ]. Contrary to such global approaches, however, SGD provides a local description focused only on specific desired behaviors.…”
Section: Subgroups Of Surface Sites Deviating From the Linear-scaling...mentioning
confidence: 99%
“…This is addressed in reference [ 8 ] by using the sure-independence-screening-and-sparsifying-operator approach [ 32 ]. Similarly, we note that other AI strategies have been developed and applied for the accurate estimation of adsorption energies [ 33 , 34 ]. Contrary to such global approaches, however, SGD provides a local description focused only on specific desired behaviors.…”
Section: Subgroups Of Surface Sites Deviating From the Linear-scaling...mentioning
confidence: 99%
“… 244 The currently available data sets include the Catalysis Hub, Open Catalyst 2020 (OC20) Data set, CMR project, etc. 244 247 Except for the limited number of data sets, the data set’s chemical diversity also limits the generalizability of ML predictions. 248 One more thing to take care of is the inconsistency of data sets, as they may obtain from different levels of theory.…”
Section: Challenges and Opportunitiesmentioning
confidence: 99%
“…First, we demonstrate the performance of PFP architecture on the Structure to Energy and Forces (S2EF) task [20]. We used the S2EF 2M dataset as training data, which is a sub-dataset two orders of magnitude smaller than the largest dataset provided by OC20.…”
Section: Benchmarksmentioning
confidence: 99%
“…The Open Catalyst Project, which targets molecular adsorption in catalytic reactions, has constructed a massive surface adsorption structure dataset known as the Open Catalyst 2020 (OC20) dataset. [19,20] In this way, the area covered by NNPs has gradually expanded.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation