Scaling Deep Learning-Based Decoding of Polar Codes via Partitioning

Cammerer, Sebastian; Gruber, Tobias; Hoydis, Jakob; Brink, Stephan ten

doi:10.1109/glocom.2017.8254811

Cited by 185 publications

(98 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use the notation h = [h 1 , h 2 , ..., h L ] to represent a network with L hidden layers, where h l denotes the number of neurons in the fully connected layer l, or the number of kernels in the convolutional layer l. In recent works that apply DNNs to decode ECCs, the training set explodes rapidly as the source word length grows. For example, with a rate 0.5 (n = 1024, k = 512) ECC, one epoch consists of 2 512 possibilities of codewords of length 1024, which results in very large complexity and makes it difficult to train and implement DNN-based decoding in practical systems [28], [29], [31], [32]. However, we note that in FL CS decoding, this problem does not exist since CS source words are typically considerably shorter, possibly only up to a few dozen symbols [1], [6]- [17].…”

Section: Results and Outlookmentioning

confidence: 99%

“…Recently several works have reported the application of DNNs to the decoding of error control codes (ECCs) [28]- [33]. A DNN enables low-latency decoding since it enables one-shot decoding, where the DNN finds its estimate by passing each layer only once [28], [31], [32]. In addition, DNNs can efficiently execute in parallel and be implemented with low-precision data types on a graphical processing unit (GPU), field programmable gate array (FPGA), or application specific integrated circuit (ASIC) [28], [31]- [33], [35].…”

mentioning

confidence: 99%

“…However, since the number of candidate codewords becomes extremely large with medium-to-large codeword lengths (i.e., a few hundred to a few thousand bits), direct application of DNNs to ECC decoding becomes difficult because of the explosive number of layers and weights. In [32], DNNs were employed on sub-blocks of the decoder, which were then connected via belief propagation decoding to enable scaling of deep learningbased ECC decoding. In [33], the authors proposed recurrent neural network (RNN)-based decoding for linear codes, which outperforms the standard belief propagation (BP) decoding and significantly reduces the number of parameters compared to BP feed-forward neural networks.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Deep Learning-Based Decoding for Constrained Sequence Codes

Cao

Fair

2018

2018 IEEE Globecom Workshops (GC Wkshps)

View full text Add to dashboard Cite

Constrained sequence (CS) codes, including fixedlength CS codes and variable-length CS codes, have been widely used in modern wireless communication and data storage systems. Sequences encoded with constrained sequence codes satisfy constraints imposed by the physical channel to enable efficient and reliable transmission of coded symbols. In this paper, we propose using deep learning approaches to decode fixed-length and variable-length CS codes. Traditional encoding and decoding of fixed-length CS codes rely on look-up tables (LUTs), which is prone to errors that occur during transmission. We introduce fixed-length constrained sequence decoding based on multiple layer perception (MLP) networks and convolutional neural networks (CNNs), and demonstrate that we are able to achieve low bit error rates that are close to maximum a posteriori probability (MAP) decoding as well as improve the system throughput. Further, implementation of capacity-achieving fixedlength codes, where the complexity is prohibitively high with LUT decoding, becomes practical with deep learning-based decoding. We then consider CNN-aided decoding of variable-length CS codes. Different from conventional decoding where the received sequence is processed bit-by-bit, we propose using CNNs to perform one-shot batch-processing of variable-length CS codes such that an entire batch is decoded at once, which improves the system throughput. Moreover, since the CNNs can exploit global information with batch-processing instead of only making use of local information as in conventional bit-by-bit processing, the error rates can be reduced. We present simulation results that show excellent performance with both fixed-length and variable-length CS codes that are used in the frontiers of wireless communication systems.

show abstract

Section: Results and Outlookmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning-Based Decoding for Constrained Sequence Codes

Cao

Fair

2018

2018 IEEE Globecom Workshops (GC Wkshps)

View full text Add to dashboard Cite

show abstract

“…Among future directions, it is worth considering to combine other neural decodes with MIND, such as neural LDPC [29] [30] and Polar [32] decoders. Beyond neural decoder design, MAML can also be applied to Channel Autoencoder [27] design, which deals with designing adaptive encoder and decoder.…”

Section: Discussionmentioning

confidence: 99%

MIND: Model Independent Neural Decoder

Jiang

Kim

Asnani

et al. 2019

2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC)

View full text Add to dashboard Cite

Standard decoding approaches rely on model-based channel estimation methods to compensate for varying channel effects, which degrade in performance whenever there is a model mismatch. Recently proposed Deep learning based neural decoders address this problem by leveraging a model-free approach via gradient-based training. However, they require large amounts of data to retrain to achieve the desired adaptivity, which becomes intractable in practical systems.In this paper, we propose a new decoder: Model Independent Neural Decoder (MIND), which builds on the top of neural decoders and equips them with a fast adaptation capability to varying channels. This feature is achieved via the methodology of Model-Agnostic Meta-Learning (MAML). Here the decoder: (a) learns a "good" parameter initialization in the meta-training stage where the model is exposed to a set of archetypal channels and (b) updates the parameter with respect to the observed channel in the meta-testing phase using minimal adaptation data and pilot bits. Building on top of existing state-ofthe-art neural Convolutional and Turbo decoders, MIND outperforms the static benchmarks by a large margin and shows minimal performance gap when compared to the neural (Convolutional or Turbo) decoders designed for that particular channel. In addition, MIND also shows strong learning capability for channels not exposed during the meta training phase.

show abstract

“…Recently, as deep learning (DL) has many revolutionary breakthroughs in the field of computer vision and natural language processing, many researchers have also been dedicated to applying this powerful technique to enhance decoding algorithms [3]- [7]. However, their decoding capacity is still worse than the state-of-the-art CRC-assisted successive cancellation list (CA-SCL) [8]- [10].…”

Section: Introductionmentioning

confidence: 99%

Low-Complexity LSTM-Assisted Bit-Flipping Algorithm For Successive Cancellation List Polar Decoder

Chen

2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Polar codes have attracted much attention in the past decade due to their capacity-achieving performance. The higher decoding capacity is required for 5G and beyond 5G (B5G). Although the cyclic redundancy check (CRC)assisted successive cancellation list bit-flipping (CA-SCLF) decoders have been developed to obtain a better performance, the solution to error bit correction (bitflipping) problem is still imperfect and hard to design. In this work, we leverage the expert knowledge in communication systems and adopt deep learning (DL) technique to obtain the better solution. A low-complexity long short-term memory network (LSTM)-assisted CA-SCLF decoder is proposed to further improve the performance of conventional CA-SCLF and avoid complexity and memory overhead. Our test results show that we can effectively improve the BLER performance by 0.11dB compared to prior work and reduce the complexity and memory overhead by over 30% of the network.

show abstract

Scaling Deep Learning-Based Decoding of Polar Codes via Partitioning

Cited by 185 publications

References 14 publications

Deep Learning-Based Decoding for Constrained Sequence Codes

Deep Learning-Based Decoding for Constrained Sequence Codes

MIND: Model Independent Neural Decoder

Low-Complexity LSTM-Assisted Bit-Flipping Algorithm For Successive Cancellation List Polar Decoder

Contact Info

Product

Resources

About