2022
DOI: 10.1038/s43246-022-00283-x
|View full text |Cite
|
Sign up to set email alerts
|

Why big data and compute are not necessarily the path to big materials science

Abstract: Applied machine learning has rapidly spread throughout the physical sciences. In fact, machine learning-based data analysis and experimental decision-making have become commonplace. Here, we reflect on the ongoing shift in the conversation from proving that machine learning can be used, to how to effectively implement it for advancing materials science. In particular, we advocate a shift from a big data and large-scale computations mentality to a model-oriented approach that prioritizes the use of machine lear… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
25
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(26 citation statements)
references
References 65 publications
1
25
0
Order By: Relevance
“…A number of the leading scientists now argue that artificial intelligence (AI) needs to move away from ever-increasing dataset and network sizes, to essentially 'do more with less [58] by using small data sets and physical priors. Similar arguments have been made in the physical science field [59,60], typically in cognizance of rich systems of prior knowledge and inferential biases and extremely small experimental budgets. This convergence between ML and physical fields is, in our opinion, a significant aspect of the last three years.…”
supporting
confidence: 53%
“…A number of the leading scientists now argue that artificial intelligence (AI) needs to move away from ever-increasing dataset and network sizes, to essentially 'do more with less [58] by using small data sets and physical priors. Similar arguments have been made in the physical science field [59,60], typically in cognizance of rich systems of prior knowledge and inferential biases and extremely small experimental budgets. This convergence between ML and physical fields is, in our opinion, a significant aspect of the last three years.…”
supporting
confidence: 53%
“…A concerted effort has been undertaken by scientists across different disciplines to address the need for ML for small datasets (Elbadawi et al, 2021a). There is undoubtedly a fallacy surrounding the idea that large data is needed for feasible predictions (Fujinuma et al, 2022). An effective model can indeed be built with only a few data instances (Baskin, 2019).…”
Section: Discussionmentioning
confidence: 99%
“…Limitations of data-driven strategies have been noted in the literature, with the need for more data or higher-quality data being stressed . Algorithmic performance can also depend on the initial data set (the “cold start” problem), and available data sets often exhibit sampling biases .…”
Section: The State Of Current Machine Learning Approachesmentioning
confidence: 99%
“…Classically, this was done by devising physical models in terms of the relevant variables and the admissible functional forms of their interactions. Physics-based computer simulations serve a similar role, although the examples above indicate their limits for exceptional materials. , We focus purely on data-driven approaches. Strategies of physics-informed machine learning are one approach for this problem.…”
Section: Recommendations Toward ML For Exceptional Materialsmentioning
confidence: 99%