Cell-free networking enables full cooperation among distributed access points (APs). This paper focuses on reducing the long-term energy consumption of a cell-free network in the downlink integrated data and energy transfer (IDET) for achieving energy sustainability. The resultant design includes both the AP classification on a large time-scale and the beamforming of the APs on a small time-scale in order to simultaneously satisfy the IDET requirements of data users and energy users. For dealing with binary integer actions (AP classification) and continuous actions (beamforming) together, we innovatively propose a stable double parameterized deep-Q-network (DP-DQN), which can be enhanced by a digital twin (DT) running in the intelligent core processor (ICP) so as to achieve faster and more stable convergence. Therefore, the cell-free network may avoid suffering from performance fluctuation during the training process. The simulation results demonstrate that our DP-DQN exceeds in convergence compared to other benchmarks while guaranteeing an optimal solution. Index Terms-Cell-free networking, integrated data and energy transfer (IDET), mixed time-scale, beamforming, AP classification, deep reinforcement learning (DRL), double parameterized deep-Q-network (DP-DQN), digital twin (DT).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.