Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures 2000
DOI: 10.1145/341800.341819
|View full text |Cite
|
Sign up to set email alerts
|

Algorithmic foundations for a parallel vector access memory system

Abstract: This paper presents mathematical foundations for the design of a memory controller subcomponent that helps to bridge the processor/memory performance gap for applications with strided access patterns. The Parallel Vector Access (PVA) unit exploits the regularity of vectors or streams to access them efficiently in parallel on a multi-bank SDRAM memory system. The PVA unit performs scatter/gather operations so that only the elements accessed by the application are transmitted across the system bus. Vector operat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2002
2002
2018
2018

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…Many methods for multiple accesses to memory for parallel and distributed operations of an algorithm have been reported. [26][27][28][29][30][31][32][33][34][35][36][37] In our work, a data management unit (DMU) is used to schedule all requests for transferring data between the FIFO buffers and on-chip memory. The DMU consists mainly of multiple address generation units (AGUs) used to control reading and writing access to on-chip memory.…”
Section: Data Management Unitmentioning
confidence: 99%
“…Many methods for multiple accesses to memory for parallel and distributed operations of an algorithm have been reported. [26][27][28][29][30][31][32][33][34][35][36][37] In our work, a data management unit (DMU) is used to schedule all requests for transferring data between the FIFO buffers and on-chip memory. The DMU consists mainly of multiple address generation units (AGUs) used to control reading and writing access to on-chip memory.…”
Section: Data Management Unitmentioning
confidence: 99%
“…Classic vector machines such as the Cray-1 [61,50] overcome the inefficiencies of DRAM overfetch and access granularity by using massive bank-switching to offer wordgranularity accesses. However, vector core designs and memory controllers are costly due to their limited market and sizable engineering costs [24].…”
Section: Related Workmentioning
confidence: 99%
“…An extension of the work presented in [9] is proposed by Mathew et al in [10,11] where the authors proposed a Parallel Vector Access unit (PVA). This unit is a vector memory subsystem that provided strided data accesses with high efficiency on a multi-module SDRAM memory.…”
Section: Background and Related Workmentioning
confidence: 99%