2016 9th International Symposium on Turbo Codes and Iterative Information Processing (ISTC) 2016
DOI: 10.1109/istc.2016.7593092
|View full text |Cite
|
Sign up to set email alerts
|

Beyond Gbps Turbo decoder on multi-core CPUs

Abstract: This paper presents a high-throughput implementation of a portable software turbo decoder. The code is optimized for traditional multi-core CPUs (like x86) and it is based on the Enhanced max-log-MAP turbo decoding variant. The code follows the LTE-Advanced specification. The key of the high performance comes from an inter-frame SIMD strategy combined with a fixed-point representation. Our results show that proposed multi-core CPU implementation of turbo-decoders is a challenging alternative to GPU implementat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 19 publications
0
6
0
Order By: Relevance
“…In order to study AVX impact on power and energy consumption, we target four different HPC applications, which have automatic vectorization (by setting a compilation flag or an environment variable). We used applications with different CPU behavior and/or available options: HPL 15 and Plasma svd 16 which use the Math Kernel Library (MKL), AFF3CT 17 and SToRM 18 . Note that AFF3CT and SToRM only use integer.…”
Section: Applicationsmentioning
confidence: 99%
“…In order to study AVX impact on power and energy consumption, we target four different HPC applications, which have automatic vectorization (by setting a compilation flag or an environment variable). We used applications with different CPU behavior and/or available options: HPL 15 and Plasma svd 16 which use the Math Kernel Library (MKL), AFF3CT 17 and SToRM 18 . Note that AFF3CT and SToRM only use integer.…”
Section: Applicationsmentioning
confidence: 99%
“…and 4G LTE standards and may be candidates for the future 6G [38]. Their decoding algorithm consists of tracking/routing two mesh networks which are connected to a data interleaver [15], [39]. Fixed data widths of 8 or 16 bits are used.…”
Section: Ecc Decoding Algorithmsmentioning
confidence: 99%
“…It's obviously that the others suffer from problems of too low throughput or too high latency, or both. Take work in [5] for example, although an excellent decoding throughput has been achieved, the decoding latency is too long to be tolerated. We also can see that our design provides a trade-off between latency and throughput.…”
Section: Optimized Decoding Algorithmmentioning
confidence: 99%
“…By fully utilizing the enormous parallelism of GPUs, many codeword blocks can be decoded simultaneously. The work in [5] implements a turbo decoder by employing high-end CPUs. However, their implementations may suffer from some problems, either with too low throughput or too high latency.…”
Section: Introductionmentioning
confidence: 99%