ontology matching (OM) is a critical process for many disciplines. It aims at identifying the semantic correspondences among different ontologies that are merged for data integration. Unfortunately, OM still faces challenges, especially, in the big data integration (BDI) area. The high degree of semantic heterogeneity problem prevents the integration of relevant data and increased with large-scale ontologies of BDI. The quality of OM still needs more improvements to cope with BDI applications. So, this paper proposes a semantic OM approach called semantic matcher. It achieves the goals of semantic heterogeneity resolving and the quality improvement. It exploits the semantic similarity based on the word embedding model. The word embedding model provides efficient distribution semantic representation of domain words as vectors based on their context. The applicability of the proposed semantic matcher is evaluated through an experiment. The conference and anatomy gold standard datasets are evaluating the experimental results. Accuracy evaluates the quality through the precision, recall, and F-measure measures. Based on the experimental results evaluation, the proposed semantic matcher is promising and efficient in the semantic OM for BDI.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.