1995
DOI: 10.1007/3-540-60584-3_27
|View full text |Cite
|
Sign up to set email alerts
|

Estimating data accuracy in a federated database environment

Abstract: Abstract:The need for integration of data in a heterogeneous or federated database environment creates a corresponding need for estimating the accuracy of the integrated data as a function of the accuracy of the originating data sources. Even in a single database system, different base relations are frequently characterized by dissimilar levels of accuracy; however, no technique exists for defining the accuracy of this single database system in terms of the accuracy of the base relations. This need is further … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

1997
1997
2011
2011

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(6 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…First, users are often provided information in tabular form. Second, the more detailed the granularity, the more expensive it is to measure and represent the quality metrics (Reddy and Wang 1995). Our methodology, however, is general, and we show how it can be applied in situations where quality metrics are available at the attribute level.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…First, users are often provided information in tabular form. Second, the more detailed the granularity, the more expensive it is to measure and represent the quality metrics (Reddy and Wang 1995). Our methodology, however, is general, and we show how it can be applied in situations where quality metrics are available at the attribute level.…”
Section: Introductionmentioning
confidence: 99%
“…They do not, however, provide a methodology for deriving quality metrics for the output. Reddy and Wang (1995) provide an analysis of the error propagation process when only inaccuracies and mismembers are present. Incompleteness is a critical data quality attribute, in particular for data warehousing applications that draw upon multiple internal and external data sources.…”
Section: Introductionmentioning
confidence: 99%
“…It is easy to verify that soundness and completeness satisfy all the requirements of a goodness measure. † Soundness and completeness are similar to precision and recall in information retrieval [Salton, McGill, 1983].…”
Section: Soundness and Completeness As Measures Of Data Qualitymentioning
confidence: 74%
“…The concept and importance of quality of data has been discussed many times in the literature (Ballou and Pazer, 1985;Batini and Scannapieco, 2006;Tupek, 2006;Wang and Strong, 1996) usually in context of the single data source. However, some research has been also done in the context of integrated data emphasizing the importance of data quality assurance in this context (Gertz and Schmitt, 1998;Naumann, 2002;Reddy and Wang, 1995). Data quality has been also considered in the context of data mining (Berti-Equille, 2007;Dasu and Johnson, 2003).…”
Section: Introductionmentioning
confidence: 99%