2013
DOI: 10.1007/978-3-642-38562-9_7
|View full text |Cite
|
Sign up to set email alerts
|

Data Fusion: Resolving Conflicts from Multiple Sources

Abstract: Abstract. Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical to resolve conflicts and discover values that reflect the real world; this task is called data fusion. This paper describes a novel approach that finds tr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 27 publications
(27 citation statements)
references
References 12 publications
0
27
0
Order By: Relevance
“…As an important aspect of this area, research on resolving conflicts from multiple sources [4,5,10,22] arise various ways to handle conflicts in data integration. A common method is to conduct voting or averaging-for categorical data, the information with the highest number of occurrences is regarded as truth; for continuous claims, the mean is taken as the true value.…”
Section: Related Workmentioning
confidence: 99%
“…As an important aspect of this area, research on resolving conflicts from multiple sources [4,5,10,22] arise various ways to handle conflicts in data integration. A common method is to conduct voting or averaging-for categorical data, the information with the highest number of occurrences is regarded as truth; for continuous claims, the mean is taken as the true value.…”
Section: Related Workmentioning
confidence: 99%
“…Resolving conflicts from multiple sources have been studied in the database community for years [4,5,10,13] resulting in multiple ways to handle conflicts in data integration. Among them, one commonly used approach to eliminate conflicts for categorical data is to conduct majority voting so that information with the highest number of occurrences is regarded as the correct answer; and for continuous values, we can simply take the mean or median as the answer.…”
Section: Source Reliabilitymentioning
confidence: 99%
“…Given a set of data items claimed by multiple sources, the truth finding (a.k.a. truth discovery) problem is to determine the true values of each claimed item, with various usages in information corroboration [12], and data fusion [8]. Similar to our crowdsourcing setting, existing work on truth finding also models the mutual reinforcing relationship between sources and data items, e.g., by a Bayesian model [52], maximum likelihood estimation [47], and latent credibility analysis [33].…”
Section: Related Workmentioning
confidence: 99%