2015
DOI: 10.2200/s00647ed1v01y201505cac032
|View full text |Cite
|
Sign up to set email alerts
|

Single-Instruction Multiple-Data Execution

Abstract: Synthesis Lectures on Computer Architecture publishes 50-to 100-page publications on topics pertaining to the science and art of designing, analyzing, selecting and interconnecting hardware components to create computers that meet functional, performance and cost goals. e scope will largely follow the purview of premier computer architecture conferences, such as ISCA, HPCA, MICRO, and ASPLOS.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
12
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(12 citation statements)
references
References 68 publications
0
12
0
Order By: Relevance
“…For example, the lengths of SSE, AVX/AVX2 and AVX512 are 128 bits (4 float elements), 256 bits (8 float elements) and 512 bits (16 float elements), respectively. Second, several instructions have been added, notably, gather and scatter instructions [42]. These instructions load/store data of discontinuous positions in memory.…”
Section: Cpu Microarchitectures and Simd Instruction Setsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the lengths of SSE, AVX/AVX2 and AVX512 are 128 bits (4 float elements), 256 bits (8 float elements) and 512 bits (16 float elements), respectively. Second, several instructions have been added, notably, gather and scatter instructions [42]. These instructions load/store data of discontinuous positions in memory.…”
Section: Cpu Microarchitectures and Simd Instruction Setsmentioning
confidence: 99%
“…Single-instruction, multiple-data (SIMD) [41] instruction sets in vector processing units are especially changed. The evolution of the SIMD instructions has taken the form of the increased vector length [42], increased number of types of instructions and decreased latency of instructions. Therefore, it is essential to use SIMD instructions effectively for extracting CPU performance.…”
Section: Introductionmentioning
confidence: 99%
“…This paper proposes a complex variable DSSE solver that supports all features of industrial estimation, including PMU measurements. The implementation is in vectorized code, i.e., it employs loop unrolling and exploits the power of modern processors that posses single instruction multiple data (SIMD) extensions [9].…”
Section: Introductionmentioning
confidence: 99%
“…The compactness of the complex matrix expressions is translated into a computer code that is easily readable, and therefore, more compliant to maintenance and upgrades. More importantly, the framework of complex variable solution is naturally suited to the implementation on modern processors that support single instruction multiple data (SIMD) operations [9], e.g., the fused multiply-accumulate complex variable operations. The DSSE solver in this article is implemented using advanced vector extensions (AVX-2) [30], which benefits from the latest version of code vectorization.…”
Section: Introductionmentioning
confidence: 99%
“…Vectorized programming, however, requires harder constraints than parallel programming in data structures. Vendor's short SIMD architectures, such as MMX, Streaming SIMD Extensions (SSE), Advanced Vector Extensions (AVX)/AVX2, AVX-512, AltiVec, and NEON, are expected to develop rapidly, and vector lengths will become longer [4]. SIMD instruction sets are changed by the microarchitecture of the CPU.…”
mentioning
confidence: 99%