PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures

Patwary, Md. Mostofa Ali; Satish, Nadathur; Sundaram, Narayanan; Liu, Jialin; Sadowski, Peter; Racah, Evan; Byna, Suren; Tull, C. E.; Bhimji, W.; Prabhat,; Dubey, Pradeep

doi:10.1109/ipdps.2016.57

Cited by 34 publications

(15 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These edges may be considered (by taking

d_{𝒜 ℬ}

to always be the distance between aggregates) but doing so is computationally infeasible for large graphs. One potential solution to this problem would be to use efficient k ‐nearest neighbor methods (e.g., Reference 15) to locate geometrically close unconnected aggregates and use such information to ensure that their balls will not overlap.…”

Section: The Two‐level Coarse‐to‐fine Proceduresmentioning

confidence: 99%

Multilevel graph embedding

Quiring

Vassilevski

2020

Numerical Linear Algebra App

View full text Add to dashboard Cite

The goal of the present paper is the design of embeddings of a general sparse graph into a set of points in ℝd for appropriate d ≥ 2. The embeddings that we are looking at aim to keep vertices that are grouped in communities together and keep the rest apart. To achieve this property, we utilize coarsening that respects possible community structures of the given graph. We employ a hierarchical multilevel coarsening approach that identifies communities (strongly connected groups of vertices) at every level. The multilevel strategy allows any given (presumably expensive) graph embedding algorithm to be made into a more scalable (and faster) algorithm. We demonstrate the presented approach on a number of given embedding algorithms and large‐scale graphs and achieve speed‐up over the methods in a recent paper.

show abstract

“…These edges may be considered (by taking

d_{𝒜 ℬ}

Section: The Two‐level Coarse‐to‐fine Proceduresmentioning

confidence: 99%

Multilevel graph embedding

Quiring

Vassilevski

2020

Numerical Linear Algebra App

View full text Add to dashboard Cite

show abstract

“…A typical implementation of nearest traversal uses a priority queue based on distances, using the closest node in each iteration. An alternative and better performing approach, first derived for k-d trees in Patwary et al [2016], is to use a stack. As stack is a Last-In-First-Out data structure, it is possible to get a behavior similar to the one of a priority queue by adding a child with a shorter distance second (so that it sits on top of the stack).…”

Section: Traversal For Nearestmentioning

confidence: 99%

ArborX

Lebrun-Grandié

Prokopenko

Turcksin

et al. 2020

ACM Trans. Math. Softw.

View full text Add to dashboard Cite

Searching for geometric objects that are close in space is a fundamental component of many applications. The performance of search algorithms comes to the forefront as the size of a problem increases both in terms of total object count as well as in the total number of search queries performed. Scientific applications requiring modern leadership-class supercomputers also pose an additional requirement of performance portability, i.e., being able to efficiently utilize a variety of hardware architectures. In this article, we introduce a new open-source C++ search library, ArborX, which we have designed for modern supercomputing architectures. We examine scalable search algorithms with a focus on performance, including a highly efficient parallel bounding volume hierarchy implementation, and propose a flexible interface making it easy to integrate with existing applications. We demonstrate the performance portability of ArborX on multi-core CPUs and GPUs and compare it to the state-of-the-art libraries such as Boost.Geometry.Index and nanoflann.

show abstract

“…The source code for the recommendation system is available on GitHub. 2 The proposed algorithm relies only on minimum Euclidean distances, which can be computed efficiently using distributed tree structures [16,17] or gpu-based implementations [18]. The proposed system is thus horizontally scalable and suitable for distributed applications.…”

Section: Recommendation Algorithmmentioning

confidence: 99%

“…Users explore different paths in the space by "liking" or "skipping" tracks. As both the mapping and the search processes are amenable to distributed tree [16,17] and gpu-based [18] parallel searches, the proposed system could be scaled up to accommodate increasing music collections and user bases.…”

Section: Introductionmentioning

confidence: 99%

Ethnic music exploration guided by personalized recommendations: system design and evaluation

Tavares

Collares

2020

SN Appl. Sci.

View full text Add to dashboard Cite

Ethnic music collections provide vantage points into the culture of different communities. However, tools that allow the exploration of those music libraries by the general public are lacking. We present and evaluate an ethnic music collection exploration system that comprises a recommendation algorithm and a web-radio interface. The proposed system provides recommendations that adapt to the user's preferences. A blind a/b user study conducted to evaluate the system suggests that personalized recommendations could help individuals from diverse backgrounds explore ethnic music collections. We believe both the system and its evaluation contribute to the growing work in the field of intelligent information systems.

show abstract

PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures

Cited by 34 publications

References 12 publications

Multilevel graph embedding

Multilevel graph embedding

ArborX

Ethnic music exploration guided by personalized recommendations: system design and evaluation

Contact Info

Product

Resources

About