Proceedings of the 23rd International Conference on World Wide Web 2014
DOI: 10.1145/2566486.2568002
|View full text |Cite
|
Sign up to set email alerts
|

Test-driven evaluation of linked data quality

Abstract: Linked Open Data (LOD) comprises an unprecedented volume of structured data on the Web. However, these datasets are of varying quality ranging from extensively curated datasets to crowdsourced or extracted data of often relatively low quality. We present a methodology for test-driven quality assessment of Linked Data, which is inspired by test-driven software development. We argue that vocabularies, ontologies and knowledge bases should be accompanied by a number of test cases, which help to ensure a basic lev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
187
0
3

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 195 publications
(191 citation statements)
references
References 26 publications
(28 reference statements)
1
187
0
3
Order By: Relevance
“…Semantic data quality tools from the state of the art such as TCD's Dacura Quality Service for OWLbased validation (Feeney, 2017) and AKSW's RDFUnit tool for SPARQL and SHACL-based data unit testing (Kontokostas, 2014).…”
Section: Potential Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…Semantic data quality tools from the state of the art such as TCD's Dacura Quality Service for OWLbased validation (Feeney, 2017) and AKSW's RDFUnit tool for SPARQL and SHACL-based data unit testing (Kontokostas, 2014).…”
Section: Potential Approachesmentioning
confidence: 99%
“…Automated support for data stakeholder or steward governance of data quality is immature due to the lack of standardisation of integration points for big data quality control systems (ISO, 2014). In contrast, for low level data quality metrics, recent advances in semantic data quality analysis show great promise (Bertossi, 2013, Feeney, 2016, Kontokostas, 2014 but methods have not yet emerged to apply them to traditional databases and semi-structured web data, where most data growth is centred. Current work on dataset meta-data standards by the W3C (Maali, 2014) would be a natural basis for business-oriented quality metadata.…”
Section: Introductionmentioning
confidence: 99%
“…DBpedia content is automatically generated from Wikipedia with limited supervision and is prone to errors. According to the research conducted by Kontokostas et al (2014) [20], 28K resources in the English language version of DBpedia share the same coordinates with another resource. This type of location error often results from the relatively low accuracy, with which coordinates are expressed and the degree of location generalization.…”
Section: Improving Geospatial Linked Data Trustworthiness and Qualitymentioning
confidence: 99%
“…We distinguish two types of issues with dbpedia: schema-level rules that define how to apply vocabularies to raw data [6,10,14], e.g., the dbo:militaryBranch property is used for entities of dbo:MilitaryUnit type, but it should only be used with entities of dbo:Person type [13]. data-level extracted, or processed and transformed data values.…”
Section: Introductionmentioning
confidence: 99%