Proceedings of the 18th ACM Conference on Information and Knowledge Management 2009
DOI: 10.1145/1645953.1646256
|View full text |Cite
|
Sign up to set email alerts
|

Maximal metric margin partitioning for similarity search indexes

Abstract: We propose a partitioning scheme for similarity search indexes that is called Maximal Metric Margin Partitioning (MMMP). MMMP divides the data on the basis of its distribution pattern, especially for the boundaries of clusters. A partitioning surface created by MMMP is likely to be at maximum distances from the two cluster boundaries. MMMP is the first similarity search index approach to focus on partitioning surfaces and data distribution patterns. We also present an indexing scheme, named the MMMP-Index, whi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2010
2010
2011
2011

Publication Types

Select...
2
1
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…We evaluated the indexing performance in comparison with existing schemes. While our previous paper [6] showed the partitioning performance of MMMP, this paper extends the experimental evaluation and assesses the index Copyright c 2010 The Institute of Electronics, Information and Communication Engineers performance.…”
Section: Introductionmentioning
confidence: 89%
“…We evaluated the indexing performance in comparison with existing schemes. While our previous paper [6] showed the partitioning performance of MMMP, this paper extends the experimental evaluation and assesses the index Copyright c 2010 The Institute of Electronics, Information and Communication Engineers performance.…”
Section: Introductionmentioning
confidence: 89%
“…All existing index-based studies focus on improving the index. While general metric indexes are designed to deal with any distances in the range queries [10,18,11], the indexes for similarity join queries assume fixed-distance range queries. eD-index [8] constructs an index that is an extension of D-index [7].…”
Section: Definition 3 (Modified Ball Partitioning) For a Metric Spacementioning
confidence: 99%
“…OMNI-Family [15] chooses a set of pivots based on the minimum bounding region. MMMP [17] has a pivot selection based on the maximal margin, and it classifies dense regions. Moreover, some methods combine different pivot selection techniques [19], [27].…”
Section: Related Workmentioning
confidence: 99%
“…However, they don't consider the partitioning boundary of the pivot and the selected pivot may not be good for pruning. The clustering-based approach [13], [17] aims at selecting a pivot and its partitioning boundary based on the clusters in the dataset. Although it can select better pivots for pruning than the simple statistics-based approach, it doesn't take into consideration the tree balancing.…”
Section: Related Workmentioning
confidence: 99%