“…Data ordered with IFC can be reviewed and studied with BIM model checking software tools, such as Solibri. Then, it can be suitably mined manually, with semantic and/or latent techniques ( [53,54,55,56,57,58]), or with dedicated data parsers [59], and exported into file formats as input for ML suites; examples of such formats are .arff files for the Waikato Environment for Knowledge Analysis (WEKA), or structured .csv files to be incorporated in ML libraries of the Surprise Scikit, a Python-powered scientific toolkit for recommender systems. But for this data to be translated into meaningful independent input variables, and then connected with meaningful dependent output variables as part of a ML modelling (and especially SML) addressing the research gap mentioned in the previous section (namely, the absence of BIM data utilization for the prediction of a building project's performance, and especially its delivery cost and time overheads), it needs to be incorporated in a suitable theoretical and conceptual framework.…”