By combining metal nodes with organic
linkers we can potentially
synthesize millions of possible metal–organic frameworks (MOFs).
The fact that we have so many materials opens many exciting avenues
but also create new challenges. We simply have too many materials
to be processed using conventional, brute force, methods. In this
review, we show that having so many materials allows us to use big-data
methods as a powerful technique to study these materials and to discover
complex correlations. The first part of the review gives an introduction
to the principles of big-data science. We show how to select appropriate
training sets, survey approaches that are used to represent these
materials in feature space, and review different learning architectures,
as well as evaluation and interpretation strategies. In the second
part, we review how the different approaches of machine learning have
been applied to porous materials. In particular, we discuss applications
in the field of gas storage and separation, the stability of these
materials, their electronic properties, and their synthesis. Given
the increasing interest of the scientific community in machine learning,
we expect this list to rapidly expand in the coming years.