2015
DOI: 10.1145/2845644
|View full text |Cite
|
Sign up to set email alerts
|

Answering enumeration queries with the crowd

Abstract: Hybrid human/computer database systems promise to greatly expand the usefulness of query processing by incorporating the crowd. Such systems raise many implementation questions. Perhaps the most fundamental issue is that the closed-world assumption underlying relational query semantics does not hold in such systems. As a consequence the meaning of even simple queries can be called into question. Furthermore, query progress monitoring becomes difficult due to nonuniformities in the arrival of crowdsourced data … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…Thus, this process incurs the monetary cost and causes time latency. Thus, the main reason for using the monetary cost and time latency metrics in our research work to estimate the missing values of skylines is due to the fact that these metrics are the most critical factors when dealing with crowdsourced databases [4], [5], [8], [10], [41], [45], [46], [47], [48]. These six metrics are measured by varying the dataset size, missing rate, the user-given threshold value for the acceptable relative error rate between the real missing and the estimated values of the skylines, and the number of the batch for crowd-sourced estimation.…”
Section: Experiments Evaluationmentioning
confidence: 99%
“…Thus, this process incurs the monetary cost and causes time latency. Thus, the main reason for using the monetary cost and time latency metrics in our research work to estimate the missing values of skylines is due to the fact that these metrics are the most critical factors when dealing with crowdsourced databases [4], [5], [8], [10], [41], [45], [46], [47], [48]. These six metrics are measured by varying the dataset size, missing rate, the user-given threshold value for the acceptable relative error rate between the real missing and the estimated values of the skylines, and the number of the batch for crowd-sourced estimation.…”
Section: Experiments Evaluationmentioning
confidence: 99%
“…There are some works that apply crowd-sourcing techniques [TKFS13,TKFS16] and develop statistical tools to find a trade-off between cost/time and completeness of results. However, while the query result is constructed incrementally (the query is performed on an initial small dataset and the result is refined as more source data arrives to the system), in our proposal an initial large dataset is approximated at once.…”
Section: Related Workmentioning
confidence: 99%
“…We therefore started to develop techniques that estimate not only the amount of missing data based on techniques from [100,101,102] but also the impact those items might have on query results [19,20]. We assume a simple data integration scenario in which (semi-)independent data sources are integrated into a single database.…”
Section: Uncertainty As Unknown Unknownsmentioning
confidence: 99%