2011 9th IEEE International Conference on ASIC 2011
DOI: 10.1109/asicon.2011.6157297
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating the data shuffle operations for FFT algorithms on SIMD DSPs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…Several previous works have fused memory-access and shuffle instructions functionally to accelerate FFTlike algorithms. Reference [7] proposed VHALFUP and VHALFDN instructions that can reduce some of the shuffle operations in the mapping of FFT algorithms, but only the time-domain radix-2 FFT algorithm is considered, and an additional MOV operation is required to store the intermediate results of the two instructions. The VEXC instruction proposed in [8] eliminates the extra MOV operation in [7].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Several previous works have fused memory-access and shuffle instructions functionally to accelerate FFTlike algorithms. Reference [7] proposed VHALFUP and VHALFDN instructions that can reduce some of the shuffle operations in the mapping of FFT algorithms, but only the time-domain radix-2 FFT algorithm is considered, and an additional MOV operation is required to store the intermediate results of the two instructions. The VEXC instruction proposed in [8] eliminates the extra MOV operation in [7].…”
Section: Related Workmentioning
confidence: 99%
“…Reference [7] proposed VHALFUP and VHALFDN instructions that can reduce some of the shuffle operations in the mapping of FFT algorithms, but only the time-domain radix-2 FFT algorithm is considered, and an additional MOV operation is required to store the intermediate results of the two instructions. The VEXC instruction proposed in [8] eliminates the extra MOV operation in [7]. However, it adds an extra register write port, and considers only the time-domain radix-2 FFT algorithm, thus has no acceleration effect on other types of FFT algorithms.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…As for implementing Fast Fourier Transforms (FFTs) on Single Instrustion Multiple Data (SIMD) instruction digital signal processing (DSP) processors [6,14,17,21], to the best of our knowledge, no such work is performed on BP polar code decoders. Therefore, our work provides not only a novel hardware solution, but also a novel approach for the implementation of BP polar code decoders on DSPs, which allow the definition of application specific SIMD instructions for Polar code decoding, like in the Cadence Tenscilica ConnX family DSPs.…”
Section: Related Workmentioning
confidence: 99%