“…However, network resources may be over-consumed during the training and data transmission process. To solve the complex and dynamic control issues, a federated deep reinforcement learning-based [101,107,108,118,123,133,134,136,144,145,154,159,161,165,167,168,171], [174-177, 179, 183, 188, 192, 198, 200], [204,208,210,213,216,221,222,[225][226][227] [177-179, 181, 187, 188, 190, 196, 197, 200, 201, 203, 206, 207, 211, 214, 218, 223], [234,236,239,243,245,248,252,254,258,275,[277][278][279], [284, 286-292, 294, 297, 300], [305,306, Wireless, radio, antenna, signal [14,18,28…”