--Data quality and accuracy affects the success of data integration in Linked Open Data (LOD). The main goal of data fusionis to represent each real-world entity once on the Web. Data inaccuracy problems exist due to misspelling and a wide range of typographical differences mainly in non-Latin languages, those problems become more complicated when a person is identified by a name, and this name can be presented differently in same/different languages. Up to author's knowledge, the previous approaches which supported Arabic person names are not designed to work with LOD. This paper proposes a framework that uses person names as matching criteria from cross-language LOD Datasets. The proposed framework has substantial improvements in matching results compared to state of the art framework of matching techniques with better matching rate which exceed 6% in precision and 6% in recall.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.