2005
DOI: 10.1007/11431855_14
|View full text |Cite
|
Sign up to set email alerts
|

Estimating Recall and Precision for Vague Queries in Databases

Abstract: Abstract.In vague queries, a user enters a value that represents some real world object and expects as the result the set of database values that represent this real world object even with not exact matching. The problem appears in databases that collect data from different sources or databases were different users enter data directly. Query engines usually rely on the use of some type of similarity metric to support data with inexact matching. The problem of building query engines to execute vague queries has… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
17
0
1

Year Published

2006
2006
2016
2016

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 13 publications
(20 citation statements)
references
References 31 publications
2
17
0
1
Order By: Relevance
“…Further, the experiments demonstrate how the proposed automatic approach leads to results that are close to those obtained by the approach that we developed previously [33], which requires human intervention.…”
Section: Introductionsupporting
confidence: 74%
See 1 more Smart Citation
“…Further, the experiments demonstrate how the proposed automatic approach leads to results that are close to those obtained by the approach that we developed previously [33], which requires human intervention.…”
Section: Introductionsupporting
confidence: 74%
“…In previous work [33], we proposed a procedure for reducing human intervention. In that procedure, instead of generating the clusters manually (Step 2 of the procedure above), the human expert just informs how many distinct real world objects are represented by the instances in the samples taken from the dataset.…”
Section: Introductionmentioning
confidence: 99%
“…However, it is worth pointing out that this sampling process could be automated through the use of clustering algorithms, as done in our previous work (Stasiu, Heuser, & da Silva, 2005). In this case, all the elements of a given cluster are considered as representing the same real world object.…”
Section: Discussionmentioning
confidence: 99%
“…Based on collection samples, a semi-automatic approach for the estimation of recall and precision values for various similarity thresholds minimizes efforts involved by static similarity threshold definitions [28,29]. It requires expert input only where the number of distinct objects contained in each sample is concerned and uses two techniques to reduce human interaction, namely (i) sample use and (ii) similarity cluster process.…”
Section: Related Workmentioning
confidence: 99%
“…A new approach [5] combines two strategies to eliminate human intervention [28,29], during the recall and precision values estimation process. They are (i) use of agglomerative hierarchical clustering algorithms and (ii) use of the silhouette coefficient for cluster evaluation.…”
Section: Related Workmentioning
confidence: 99%