The Linked Open Data (LOD) includes over 31 billion Resource Description Framework (RDF) triples interlinked by around 504 million SameAs links (as of September 2011). The data sets of the LOD use different ontologies to describe instances, that cause the ontology heterogeneity problem. Dealing with the heterogeneous ontologies is a challenging problem and it is time-consuming to manually learn big ontologies in the LOD. The heterogeneity of ontologies in the LOD can be reduced by automatically integrating related ontology classes and properties, which can be retrieved from interlinked instances. The interlinked instances can be represented as an undirected graph, from which we can discover the characteristics of instances and retrieve related ontology classes and properties that are important for linking instances. In this paper, we retrieve graph patterns from several linked data sets and perform ontology alignment methods on each graph pattern to identify related ontology classes and properties from the data sets. We successfully integrate various ontologies, analyze the characteristics of interlinked instances, and detect mistaken properties in the real data sets. Furthermore, our approach solves the ontology heterogeneity problem and helps Semantic Web application developers easily query on various data sets with the integrated ontology.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.