Zoom: SSD-based Vector Search for Optimizing Accuracy, Latency and Memory

Zhang, Minjia; He, Yuxiong

doi:10.48550/arxiv.1809.04067

Cited by 2 publications

(2 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…By analyzing memory-disk ANNS algorithms [1][2][3][4] we can find that when ANNS algorithms combined with external disk such as SSD, are caching the data of edges and points into memory index as much as possible to facilitate fast response to search results. We use the DiskANN [1] method for indexing on the SIFT10M dataset, and we use R=64, L=50, and T=16 for parameter selection(R denotes the number of neighbors per vertex in the graph structure, L denotes the number of hops per point when building the index, and T denotes the number of threads used to build the index.…”

Section: Motivationmentioning

confidence: 99%

NDANN: efficient SSD-based approximate nearest neighbor search through navigation

Yang

Wang²,

Tan³

et al. 2022

International Conference on Mechanisms and Robotics (ICMAR 2022)

View full text Add to dashboard Cite

Graph-based Approximate Nearest Neighbor Search (ANNS) algorithms have attracted more attention due to their better performance. Numerous ANNS optimization methods have been proposed by researchers. However, Current graph-based ANNS algorithms still cannot index billion-scale datasets on a single server with 256GB RAM. Although several researchers have investigated on this problem, we believe there is still an improvement in reducing disk accesses. In this paper, we provide a fast navigation layer to decrease the number of disk accesses by assisting query points in swiftly reaching the range of strongly connected components. Compared with the state-of-the-art ANNS algorithms, NDANN reduces the mean latency by about 20% under the same recall.

show abstract

Section: Motivationmentioning

confidence: 99%

NDANN: efficient SSD-based approximate nearest neighbor search through navigation

Yang

Wang²,

Tan³

et al. 2022

International Conference on Mechanisms and Robotics (ICMAR 2022)

View full text Add to dashboard Cite

show abstract

“…Nearest Neighbor Search (NNS) is a fundamental building block in various application domains [7,8,35,64,67,76,101,110], such as information retrieval [31,111], pattern recognition [26,54], data mining [41,44], machine learning [21,25], and recommendation systems [66,78]. With the explosive growth of datasets' scale and the inevitable curse of dimensionality, accurate NNS cannot meet This work is licensed under the Creative Commons BY-NC-ND 4.0 International License.…”

Section: Introductionmentioning

confidence: 99%

A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search

Wang¹,

Xu²,

Ye³

et al. 2021

Preprint

View full text Add to dashboard Cite

Approximate nearest neighbor search (ANNS) constitutes an important operation in a multitude of applications, including recommendation systems, information retrieval, and pattern recognition. In the past decade, graph-based ANNS algorithms have been the leading paradigm in this domain, with dozens of graph-based ANNS algorithms proposed. Such algorithms aim to provide effective, efficient solutions for retrieving the nearest neighbors for a given query. Nevertheless, these efforts focus on developing and optimizing algorithms with different approaches, so there is a real need for a comprehensive survey about the approaches' relative performance, strengths, and pitfalls. Thus here we provide a thorough comparative analysis and experimental evaluation of 13 representative graph-based ANNS algorithms via a new taxonomy and fine-grained pipeline. We compared each algorithm in a uniform test environment on eight real-world datasets and 12 synthetic datasets with varying sizes and characteristics. Our study yields novel discoveries, offerings several useful principles to improve algorithms. This effort also helped us pinpoint algorithms' working portions, along with rule-of-thumb recommendations about promising research directions and suitable algorithms for practitioners in different fields.

show abstract

Zoom: SSD-based Vector Search for Optimizing Accuracy, Latency and Memory

Cited by 2 publications

References 25 publications

NDANN: efficient SSD-based approximate nearest neighbor search through navigation

NDANN: efficient SSD-based approximate nearest neighbor search through navigation

A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search

Contact Info

Product

Resources

About