Two-dimensional (2D)
semiconductors are central to many scientific
fields. The combination of two semiconductors (heterostructure) is
a good way to lift many technological deadlocks. Although ab initio
calculations are useful to study physical properties of these composites,
their application is limited to few heterostructure samples. Herein,
we use machine learning to predict key characteristics of 2D materials
to select relevant candidates for heterostructure building. First,
a label space is created with engineered labels relating to atomic
charge and ion spatial distribution. Then, a meta-estimator is designed
to predict label values of heterostructure samples having a defined
band alignment (descriptor). To this end, independently trained k-nearest
neighbors (KNN) regression models are combined to boost the regression.
Then, swarm intelligence principles are used, along with the boosted
estimator’s results, to further refine the regression. This
new “swarm smart” algorithm is a powerful and versatile
tool to select, among experimentally existing, computationally studied,
and not yet discovered van der Waals heterostructures, the most likely
candidate materials to face the scientific challenges ahead.