Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays 2013
DOI: 10.1145/2435264.2435292
|View full text |Cite
|
Sign up to set email alerts
|

Architectural enhancements in Stratix V™

Abstract: This paper describes architectural enhancements in the Altera Stratix-V™ FPGA architecture, built on a 28nm TSMC process, together with the data supporting those choices. Among the key features are time borrowing flip-flops, a doubling of the number of flip-flops per LUT compared to previous Stratix architectures, a simplified embedded 20kb dual-port RAM block, and error correction that can correct up to 8 adjacent errors. Arithmetic performance is significantly improved using a fast adder with two levels of m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(17 citation statements)
references
References 21 publications
1
13
0
Order By: Relevance
“…For example, Altera provides logic clusters that support multiple clocks (whereas Xilinx only supports one per cluster) as well as fine-grained time borrowing for flip-flops [14]. Lastly, we would like to investigate how to apply negotiated congestion into our routing algorithms.…”
Section: Future Workmentioning
confidence: 99%
“…For example, Altera provides logic clusters that support multiple clocks (whereas Xilinx only supports one per cluster) as well as fine-grained time borrowing for flip-flops [14]. Lastly, we would like to investigate how to apply negotiated congestion into our routing algorithms.…”
Section: Future Workmentioning
confidence: 99%
“…We have chosen a depopulated crossbar as this is common in most commercial devices [2], [5]. The depopulated crossbar is composed of four, smaller, fully populated crossbars as designed by Chiasson in [13]; this depopulation results in the soft logic block inputs being divided into four groups of ten logically equivalent pins.…”
Section: Base Architecture Modelmentioning
confidence: 99%
“…The next 3 circuits (bgm, boundtop and LU32PEEng) have reductions in the critical path LUT depth of more than 20% when targeting the U-CLA architecture, even though no hard adders occur on their critical paths. This indicates that adder logic was likely timing critical in the Soft architecture 2 , but has sped up enough to move off the critical path in the U-CLA architecture. Interestingly, while these 3 circuits have an average LUT depth that is 28% lower when targeting U-CLA vs. Soft, only LU32PEEng speeds up significantly, and the average delay reduction across the 3 designs is only 7%.…”
Section: E Circuit-by-circuit Breakdownmentioning
confidence: 99%
See 2 more Smart Citations