2006
DOI: 10.1109/sips.2006.352558
|View full text |Cite
|
Sign up to set email alerts
|

Performance Evaluation of an SIMD Architecture with a Multi-bank Vector Memory Unit

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2008
2008
2017
2017

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…Additional sub-word data manipulation instructions are needed to prepare vector operands [43] for vector ALUs. We have implemented a three operand shuffle instruction that can generate the output as any combination of the sub-words of two input operands, while the third operand, sets selection criteria either through an immediate value or a register as seen in Figure 6.…”
Section: E Ex-stage: Alumentioning
confidence: 99%
“…Additional sub-word data manipulation instructions are needed to prepare vector operands [43] for vector ALUs. We have implemented a three operand shuffle instruction that can generate the output as any combination of the sub-words of two input operands, while the third operand, sets selection criteria either through an immediate value or a register as seen in Figure 6.…”
Section: E Ex-stage: Alumentioning
confidence: 99%
“…Although the SIMdD supports non-aligned and irregular data access efficiently, it is based on costly multi-port memory. The other memory architectures such as multibank scratch-pad memory and dual-bank cache memory are studied in [8] [5]. The multi-bank memory units perform non-aligned and stride access efficiently, while the dual-bank cache can efficiently support only non-aligned access.…”
Section: Related Workmentioning
confidence: 99%
“…x [8] x+7 x+3 x+0 x+8 Figure 6: Index register-based 4-port memory compiler framework. Although this scheme needs an extra software overhead, its operation at runtime is simple and efficient because it provides a separate address space to the packing buffer and there is no need of concerning the coherency problem.…”
Section: Irregular Data Access and Hard-ware Supportmentioning
confidence: 99%
See 1 more Smart Citation