“…To solve the heterogeneity problem, we partially follow the work of [3], where the author presented a framework that automatically identifies approximate foreign-key joins in the multiple heterogeneous databases. Moreover, our system performs better in finding the most useful joins across the data sources, thanks to the regression model used in predicting the link usefulness [4,5,6]. To perform the classification task, we use the decision tree classification algorithm that exploits the joins discovered automatically across the databases [4,5,6].…”