Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays 2024
DOI: 10.1145/3626202.3637569
|View full text |Cite
|
Sign up to set email alerts
|

SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration

Jinming Zhuang,
Zhuoping Yang,
Shixin Ji
et al.

Abstract: With the increase in the computation intensity of the chip, the mismatch between computation layer shapes and the available computation resource significantly limits the utilization of the chip. Driven by this observation, prior works discuss spatial accelerators or dataflow architecture to maximize the throughput. However, using spatial accelerators could potentially increase the execution latency. In this work, we first systematically investigate two execution models: (1) sequentially (temporally) launch one… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
references
References 33 publications
0
0
0
Order By: Relevance