2022
DOI: 10.3389/fdata.2022.850611
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Data Quality Measurement and Monitoring Tools

Abstract: High-quality data is key to interpretable and trustworthy data analytics and the basis for meaningful data-driven decisions. In practical scenarios, data quality is typically associated with data preprocessing, profiling, and cleansing for subsequent tasks like data integration or data analytics. However, from a scientific perspective, a lot of research has been published about the measurement (i.e., the detection) of data quality issues and different generally applicable data quality dimensions and metrics ha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
39
0
7

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 77 publications
(46 citation statements)
references
References 50 publications
0
39
0
7
Order By: Relevance
“…Analyzing and steering Data Quality in analytical processes are very relevant activities in computer science and data analysis [e.g., Ahmed et al (2018) for improvement of data quality in intelligent e-CRM applications, or Vielberth et al (2021) for security incidents], where many automatic tools exist that support these tasks, be it to different degrees. Recent work by Ehrlinger et al (2019) surveyed 667 software tools dedicated to data quality. Among other considerations, the authors report on the large heterogeneity of the tools, evidencing several limitations: (1) more than half of them work only with proprietary solutions; (2) most of them lack implementation of important Data Quality dimensions identified in the state-of-the-art literature, (3) most of them do not support comparability due to the way in which metrics are defined, and (4) they lack user interaction and exploration capabilities supported by visualization for data quality analysis (most of them only focus on usability of basic GUIs useful just to conduct the analysis and not to explore results or support hypothesis forming and testing).…”
Section: Related Workmentioning
confidence: 99%
“…Analyzing and steering Data Quality in analytical processes are very relevant activities in computer science and data analysis [e.g., Ahmed et al (2018) for improvement of data quality in intelligent e-CRM applications, or Vielberth et al (2021) for security incidents], where many automatic tools exist that support these tasks, be it to different degrees. Recent work by Ehrlinger et al (2019) surveyed 667 software tools dedicated to data quality. Among other considerations, the authors report on the large heterogeneity of the tools, evidencing several limitations: (1) more than half of them work only with proprietary solutions; (2) most of them lack implementation of important Data Quality dimensions identified in the state-of-the-art literature, (3) most of them do not support comparability due to the way in which metrics are defined, and (4) they lack user interaction and exploration capabilities supported by visualization for data quality analysis (most of them only focus on usability of basic GUIs useful just to conduct the analysis and not to explore results or support hypothesis forming and testing).…”
Section: Related Workmentioning
confidence: 99%
“…This work has been motivated by the identification of the need to monitor quality metrics in time series to ensure the high DQ over time through the possibility of correcting the problems identified [37]. In addition, the available tools do not finish solving the implementation of quality metrics.…”
Section: Related Workmentioning
confidence: 99%
“…Ferramenta dedicada à preparac ¸ão e qualidade de dados. A ferramenta AP fornece uma plataforma integrada de gerenciamento de dados que, além de recursos voltados para a preparac ¸ão de dados, também fornece limpeza de dados, análises estatísticas, correspondência de padrões e a criac ¸ão de profiling de dados [Ehrlinger and Wöß 2022].…”
Section: Ferramentas De Qualidade De Dados Selecionadasunclassified
“…Ferramenta voltada para big data, dedicada a medir continuamente a qualidade de dados em lote ou em streaming. A ferramenta AG oferece um conjunto de modelos de domínio de qualidade de dados bem definidos, que cobre diferentes problemas de qualidade de dados em geral [Ehrlinger and Wöß 2022].…”
Section: Ferramentas De Qualidade De Dados Selecionadasunclassified
See 1 more Smart Citation