INV-ASKIT: A Parallel Fast Direct Solver for Kernel Matrices

Yu, Chih‐Wei; March, William B.; Xiao, Bo; Biros, George

doi:10.1109/ipdps.2016.12

Cited by 12 publications

(27 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We also briefly summarize the ASKIT algorithm which we use as the basis for our new methods. We describe parallel factorization schemes in §II-B and highlight the novelty of our approach over [36]. We then introduce our hybrid iterative/direct solver in §II-C.…”

Section: Methodsmentioning

confidence: 99%

“…This algorithm improves on the one in [36] by removing the extra subtree traversals that result in O(N log 2 N ) complexity. Instead, our algorithm exploits the nested structure ofP α α resulting in an N log N complexity for the factorization.…”

Section: B Fast Direct Solvermentioning

confidence: 99%

“…Parallel direct solver. The parallelization is essentially identical to the scheme proposed in [36]. Each subtree (a set of points {x}) is assigned to a distributed-memory process (or a worker).…”

Section: B Fast Direct Solvermentioning

confidence: 99%

“…Also our methods cannot be applied to hierarchical decompositions in which D is sparse and not just block diagonal. For such decompositions, our method can be used as a preconditioner, as discussed in [36].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

An N log N Parallel Fast Direct Solver for Kernel Matrices

March

Biros

2017

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Self Cite

View full text Add to dashboard Cite

Kernel matrices appear in machine learning and non-parametric statistics. Given N points in d dimensions and a kernel function that requires O(d) work to evaluate, we present an O(dN log N )-work algorithm for the approximate factorization of a regularized kernel matrix, a common computational bottleneck in the training phase of a learning task. With this factorization, solving a linear system with a kernel matrix can be done with O(N log N ) work. Our algorithm only requires kernel evaluations and does not require that the kernel matrix admits an efficient global low rank approximation. Instead our factorization only assumes low-rank properties for the off-diagonal blocks under an appropriate row and column ordering. We also present a hybrid method that, when the factorization is prohibitively expensive, combines a partial factorization with iterative methods. As a highlight, we are able to approximately factorize a dense 11M × 11M kernel matrix in 2 minutes on 3,072 x86 "Haswell" cores and a 4.5M × 4.5M matrix in 1 minute using 4,352 "Knights Landing" cores.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: B Fast Direct Solvermentioning

confidence: 99%

Section: B Fast Direct Solvermentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

An N log N Parallel Fast Direct Solver for Kernel Matrices

March

Biros

2017

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Self Cite

View full text Add to dashboard Cite

show abstract

“…A possible future work is to fully compare the classification accuracy and the speed with STRUMPACK to that obtained from the method described in [4].…”

Section: Kernel Ridge Regression For Classificationmentioning

confidence: 99%

A Study of Clustering Techniques and Hierarchical Matrix Formats for Kernel Ridge Regression

Rebrova

Chávez

Liu

et al. 2018

2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

View full text Add to dashboard Cite

We present memory-efficient and scalable algorithms for kernel methods used in machine learning. Using hierarchical matrix approximations for the kernel matrix the memory requirements, the number of floating point operations, and the execution time are drastically reduced compared to standard dense linear algebra routines. We consider both the general H matrix hierarchical format as well as Hierarchically Semi-Separable (HSS) matrices. Furthermore, we investigate the impact of several preprocessing and clustering techniques on the hierarchical matrix compression. Effective clustering of the input leads to a ten-fold increase in efficiency of the compression. The algorithms are implemented using the STRUMPACK solver library. These results confirm that -with correct tuning of the hyperparameters -classification using kernel ridge regression with the compressed matrix does not lose prediction accuracy compared to the exact -not compressed -kernel matrix and that our approach can be extended to O(1M ) datasets, for which computation with the full kernel matrix becomes prohibitively expensive. We present numerical experiments in a distributed memory environment up to 1,024 processors of the NERSC's Cori supercomputer using wellknown datasets to the machine learning community that range from dimension 8 up to 784.

show abstract

A literature survey of matrix methods for data science

Stoll

2020

GAMM-Mitteilungen

View full text Add to dashboard Cite

Efficient numerical linear algebra is a core ingredient in many applications across almost all scientific and industrial disciplines. With this survey we want to illustrate that numerical linear algebra has played and is playing a crucial role in enabling and improving data science computations with many new developments being fueled by the availability of data and computing resources. We highlight the role of various different factorizations and the power of changing the representation of the data as well as discussing topics such as randomized algorithms, functions of matrices, and high‐dimensional problems. We briefly touch upon the role of techniques from numerical linear algebra used within deep learning.

show abstract

INV-ASKIT: A Parallel Fast Direct Solver for Kernel Matrices

Cited by 12 publications

References 20 publications

An N log N Parallel Fast Direct Solver for Kernel Matrices

An N log N Parallel Fast Direct Solver for Kernel Matrices

A Study of Clustering Techniques and Hierarchical Matrix Formats for Kernel Ridge Regression

A literature survey of matrix methods for data science

Contact Info

Product

Resources

About