2018
DOI: 10.1145/3200691.3178522
|View full text |Cite
|
Sign up to set email alerts
|

Shared-memory parallelization of MTTKRP for dense tensors

Abstract: e matricized-tensor times Khatri-Rao product (MTTKRP) is the computational bo leneck for algorithms computing CP decompositions of tensors. In this paper, we develop shared-memory parallel algorithms for MTTKRP involving dense tensors. e algorithms cast nearly all of the computation as matrix operations in order to use optimized BLAS subroutines, and they avoid reordering tensor entries in memory. We benchmark sequential and parallel performance of our implementations, demonstrating high sequential performance… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…The form of the derivative is a matricized tensor times Khatri-Rao product (MTTKRP) with the tensor Y and the Khatri-Rao product Z k . The MTTKRP is the dominant kernel in the standard CP computation in terms of computation time and has optimized high-performance implementations [7,56,40,31]. In the dense case, the MTTKRP costs O(rn d ).…”
Section: Gcp Gradientmentioning
confidence: 99%
“…The form of the derivative is a matricized tensor times Khatri-Rao product (MTTKRP) with the tensor Y and the Khatri-Rao product Z k . The MTTKRP is the dominant kernel in the standard CP computation in terms of computation time and has optimized high-performance implementations [7,56,40,31]. In the dense case, the MTTKRP costs O(rn d ).…”
Section: Gcp Gradientmentioning
confidence: 99%
“…Our proposed algorithm uses dimension trees, but we also benchmark our implementation without that optimization to highlight its importance. We use an existing implementation to perform the individual MTTKRPs [36] with this approach.…”
Section: Comparison Implementationsmentioning
confidence: 99%