2023
DOI: 10.1021/acscatal.3c01914
|View full text |Cite
|
Sign up to set email alerts
|

Unlocking New Insights for Electrocatalyst Design: A Unique Data Science Workflow Leveraging Internet-Sourced Big Data

Rui Ding,
Xuebin Wang,
Aidong Tan
et al.

Abstract: In the past few decades, numerous electrocatalyst design studies have been reported. Although machine learning (ML) has recently emerged as a more efficient alternative to traditional trial-and-error methods, the cost of preparing training data remains high. Inspired by the success of models like ChatGPT, which learns from a vast corpus of text data collected from the internet, we developed a data science workflow initiated by collecting datasets via a highly automated web crawler. We trained artificial neural… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 72 publications
0
3
0
Order By: Relevance
“…[118] And machine learning can also be used to predict the new kind of MXenes and fastly evaluate the potential candidates for CO 2 photoreduction. [119] Combined with large language model like ChatGPT and previous firstprinciples simulations, the discovery of new MXenes-based photocatalysts with excellent ability to adsorb and activate CO 2 will be accelerated. [120] Another vital factor influencing the photocatalytic efficiency is the separation of photo-induced charge carriers.…”
Section: Conclusion and Perspectivementioning
confidence: 99%
“…[118] And machine learning can also be used to predict the new kind of MXenes and fastly evaluate the potential candidates for CO 2 photoreduction. [119] Combined with large language model like ChatGPT and previous firstprinciples simulations, the discovery of new MXenes-based photocatalysts with excellent ability to adsorb and activate CO 2 will be accelerated. [120] Another vital factor influencing the photocatalytic efficiency is the separation of photo-induced charge carriers.…”
Section: Conclusion and Perspectivementioning
confidence: 99%
“…Thus, this approach should be viewed merely as a preliminary exercise for TM X-ides. While these ML approaches are robust and have been extensively utilized to derive significant insights into the catalytic performance of various materials, , researchers must exercise caution when making claims about the relevance and impact of specific descriptors. Future investigations must prioritize the use of more reliable data sets, as the quality of data fed into ML models critically determines their output’s accuracy. Simply put, the efficacy of ML outcomes is directly proportional to the quality of the input data.…”
Section: Understanding Performance Trends Using Machine Learningmentioning
confidence: 99%
“…Despite advances in scraping and tabulating data from the literature by using Natural Language Processing (NLP) techniques [14][15][16][17] in tandem with data extraction techniques from gures, the quality of the obtained data is still limited due to incompleteness and lack of structure of published data. 18 Moreover, due to societal bias, publication of unsuccessful attempts is oen neglected, despite being essential not only for modeling but also to prevent other researchers from conducting redundant experiments. [19][20][21] By structuring research data, the application of ML techniques can benet twofold from simpler inclusion of unsuccessful experiments in a tabulated format as well as from more reliable extraction of input data.…”
Section: Introductionmentioning
confidence: 99%