2019 34th International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC) 2019
DOI: 10.1109/itc-cscc.2019.8793377
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Implementation of Strassen's Algorithm for Memory Allocation using AVX Intrinsic on Multi-core Architecture

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 8 publications
0
1
0
Order By: Relevance
“…In Strassen's algorithm, there are seven submatrices-calculations (M 1 to M 7) that are independent works that can be executed by parallel section constructs. The implementation in [31] only presented the performance of Strassen's algorithm in terms of GF LOP S and speed-up computation excluding the measurement of energy/power consumption. The most suitable recursive stop-point was proposed in [31], and its tiling pattern was different from our proposed 2D blocking method, as shown in Fig.…”
Section: Avx Vectorization and Openmpmentioning
confidence: 99%
“…In Strassen's algorithm, there are seven submatrices-calculations (M 1 to M 7) that are independent works that can be executed by parallel section constructs. The implementation in [31] only presented the performance of Strassen's algorithm in terms of GF LOP S and speed-up computation excluding the measurement of energy/power consumption. The most suitable recursive stop-point was proposed in [31], and its tiling pattern was different from our proposed 2D blocking method, as shown in Fig.…”
Section: Avx Vectorization and Openmpmentioning
confidence: 99%