2014
DOI: 10.1007/978-3-319-07443-6_16
|View full text |Cite
|
Sign up to set email alerts
|

NLP Data Cleansing Based on Linguistic Ontology Constraints

Abstract: Abstract. Linked Data comprises of an unprecedented volume of structured data on the Web and is adopted from an increasing number of domains. However, the varying quality of published data forms a barrier for further adoption, especially for Linked Data consumers. In this paper, we extend a previously developed methodology of Linked Data quality assessment, which is inspired by test-driven software development. Specifically, we enrich it with ontological support and different levels of result reporting and des… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 12 publications
(15 citation statements)
references
References 11 publications
0
15
0
Order By: Relevance
“…The two main components of our solution are: the rml (Section 3.2) that uses mapping definitions expressed in rdf, a prerequisite for uniform quality assessment and automated refinements, as we discussed above, and the rdfunit validation framework (Section 3.2) due to its associated test-case-based architecture [20]. A proof-of-concept implementation relies on the rmlvalidator which can be found at https://github.com/RMLio/RML-Validator.git.…”
Section: Quality Assessment and Refinement With [R2]rml And Rdfunitmentioning
confidence: 99%
See 1 more Smart Citation
“…The two main components of our solution are: the rml (Section 3.2) that uses mapping definitions expressed in rdf, a prerequisite for uniform quality assessment and automated refinements, as we discussed above, and the rdfunit validation framework (Section 3.2) due to its associated test-case-based architecture [20]. A proof-of-concept implementation relies on the rmlvalidator which can be found at https://github.com/RMLio/RML-Validator.git.…”
Section: Quality Assessment and Refinement With [R2]rml And Rdfunitmentioning
confidence: 99%
“…The rdfunit ontology provides multiple result representations in different formats [20],including rdfbased serialisations (rut:ExtendedTestCaseResult result type). Therefore, its results are easily processed by an agent that can automatically add and delete triples or suggest actions to the data publisher.…”
Section: [R2]rml Refinements Based On Quality Assessmentmentioning
confidence: 99%
“…Dimitris et. al., [14] uses an NLP based approach for data cleaning where it uses Linked Data for cleaning data. In 2005, Xin et.…”
Section: Related Workmentioning
confidence: 99%
“…RDFUnit [10][11][12] 17 is a framework for test-driven Linked Data quality assessment, which is inspired by test-driven software development. A key principle of test-driven software development is to start the development with the implementation of automated test-methods before the actual functionality is implemented.…”
Section: Linked Data Quality Assessment With Rdfunitmentioning
confidence: 99%
“…Since different NER tools classify the entities with classes from different classification systems (classification ontologies), we perform alignment of those ontologies to the DBpedia Ontology 23 . In the future, we hope to exploit the availability of interoperable NIF corpora as described in [10].…”
Section: Benchmarking Semantic Named Entity Recognition Systemsmentioning
confidence: 99%