Transformer-Empowered 6G Intelligent Networks: From Massive MIMO Processing to Semantic Communication

Wang, Yan; Gao, Z.; Zheng, Dezhi; Chen, Sheng; Gündüz, Deniz; Poor, H. Vincent

doi:10.48550/arxiv.2205.03770

Cited by 1 publication

(2 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In 2017, the authors of [18] introduced a pure-attention-based DNN architecture, dubbed "transformer", to replace the prevalent RNN models for machine translation tasks. Since then, pure-attention-based DNN models are getting more and more popular in the DL community, while also finding applications in wireless communications [27].…”

Section: A Attentionnetmentioning

confidence: 99%

See 1 more Smart Citation

AttentionCode: Ultra-Reliable Feedback Codes for Short-Packet Communications

Shao¹,

Ozfatura²,

Perotti³

et al. 2022

Preprint

View full text Add to dashboard Cite

Ultra-reliable short-packet communication is a major challenge in future wireless networks with critical applications. To achieve ultra-reliable communications beyond 99.999%, this paper envisions a new interaction-based communication paradigm that exploits the feedback from the receiver for the sixth generation (6G) communication networks and beyond. We present AttentionCode, a new class of feedback codes leveraging deep learning (DL) technologies. The underpinnings of Atten-tionCode are three architectural innovations: AttentionNet, input restructuring, and adaptation to fading channels, accompanied by several training methods, including large-batch training, distributed learning, look-ahead optimizer, training-test signalto-noise ratio (SNR) mismatch, and curriculum learning. The training methods can potentially be generalized to other wireless communication applications with machine learning. Numerical experiments verify that AttentionCode establishes a new state of the art among all DL-based feedback codes in both additive white Gaussian noise (AWGN) channels and fading channels. In AWGN channels with noiseless feedback, for example, AttentionCode achieves a block error rate (BLER) of 10 −7 when the forward channel SNR is 0 dB for a block size of 50 bits, demonstrating the potential of AttentionCode to provide ultra-reliable short-packet communications for 6G.

show abstract

Section: A Attentionnetmentioning

confidence: 99%

“…The update rule of ( 27) is derived for the basic optimizer that performs mini-batch gradient descent. In this case, distributed learning in (28) is equivalent to (27). On the other hand, advanced optimizers, such as Adam or AdaMax [31], often take the momentum of gradients and adaptive learning rates into account.…”

Section: B Distributed Learningmentioning

confidence: 99%