“…The main advantage of ML is that it allows the development of a surrogate model for composition–property relationships from data, which, in turn, can be used for screening materials . Since ML approaches are data-hungry, most works rely on data obtained from the literature or produced synthetically using computer simulations. ,,,− However, glass properties are highly sensitive to the preparation and testing conditions. , Thus, the raw materials, experimental conditions, and glass preparation techniques of all the glasses must be consistent for developing accurate ML models. If available, some of these properties can be extracted from the literature employing approaches such as natural language processing. ,,, While this issue could be alleviated to a great extent by using synthetic data generated using molecular simulations, these methods have inherent limitationssuch as high cooling rates and small system sizeswhich makes the extrapolation to new glass compositions questionable.…”