2022
DOI: 10.1007/978-3-030-91738-8_11
|View full text |Cite
|
Sign up to set email alerts
|

Metadata Quality in the Era of Big Data and Unstructured Content

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4

Relationship

2
6

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…In addition, clean, accurate data can reduce the need for manual data cleaning and processing, saving time and resources. Moreover, poor-quality data can be costly for organizations [8], leading to additional data cleaning, correction, and rework costs. Organizations can avoid these costs by ensuring data quality and using their resources better.…”
Section: A the Importance Of Data Qualitymentioning
confidence: 99%
“…In addition, clean, accurate data can reduce the need for manual data cleaning and processing, saving time and resources. Moreover, poor-quality data can be costly for organizations [8], leading to additional data cleaning, correction, and rework costs. Organizations can avoid these costs by ensuring data quality and using their resources better.…”
Section: A the Importance Of Data Qualitymentioning
confidence: 99%
“…In particular, the quality of metadata has a significant impact on the reusability of data (Elouataoui et al, 2022;Kindling and Strecker, 2022). However, the requirements of high quality metadata can be guided by repositories (Trisovic et al, 2021).…”
Section: Data Repositoriesmentioning
confidence: 99%
“…This process is known as Data Deduplication. Data duplication can occur for different reasons, such as data integration, where data are gathered from multiple data sources so the same information can be recorded more than once in another format [21] [22]. Also, data duplication could be related to human errors, so the same person, for example, could provide data with slightly different information intentionally or by mistake multiple times.…”
Section: Big Data Deduplicationmentioning
confidence: 99%