Proceedings of the 46th International Symposium on Computer Architecture 2019
DOI: 10.1145/3307650.3322249
|View full text |Cite
|
Sign up to set email alerts
|

Scalable interconnects for reconfigurable spatial architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
2
2
2

Relationship

1
5

Authors

Journals

citations
Cited by 21 publications
(9 citation statements)
references
References 48 publications
0
9
0
Order By: Relevance
“…We evaluate Capstan's performance using a custom cycleaccurate C++ simulator; our simulator models the effects of a hybrid static-dynamic network [75] and uses Ramulator [38] to model DRAM behavior. The simulator is validated against microbenchmarks with known performance characteristics.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations

Capstan: A Vector RDA for Sparsity

Rucker,
Vilim,
Zhao
et al. 2021
Preprint
Self Cite
“…We evaluate Capstan's performance using a custom cycleaccurate C++ simulator; our simulator models the effects of a hybrid static-dynamic network [75] and uses Ramulator [38] to model DRAM behavior. The simulator is validated against microbenchmarks with known performance characteristics.…”
Section: Discussionmentioning
confidence: 99%
“…However, due to halo exchange, convolution maps poorly to positional dataflow: in a streaming-positional architecture, each tile's accumulation buffer would need eight links (input and output) to neighboring tiles. Although we can map convolution to the shuffle network using Spatial (using 100% of the on-chip shuffle re- sources), using the dynamic network [75] in a non-positional (i.e., out-of-order) mode yields 3.8 times higher performance. Without manual mapping, Capstan still out-performs a CPU and GPU; however, manual mapping is used to compare against SCNN (which uses a similar tiled architecture).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations

Capstan: A Vector RDA for Sparsity

Rucker,
Vilim,
Zhao
et al. 2021
Preprint
Self Cite
“…Other open research challenges in the design of accelerators include supporting software definable interconnects and logic blocks [317] with run-time reconfiguration and dynamic resource allocation. In contrast to an FPGA, a GPU can switch between threads at run-time and thus can be instantaneously reconfigured to run different tasks.…”
Section: ) Summary Of Acceleratorsmentioning
confidence: 99%