With flexibility, convenience and mobility, unmanned aerial vehicles (UAVS) can provide wireless communication networks with lower costs, easier deployment, higher network scalability and larger coverage. This paper proposes the deep deterministic policy gradient algorithm to jointly optimize the power allocation and flight trajectory of UAV with constrained effective energy to maximize the downlink throughput to ground users. To validate the proposed algorithm, we compare with the random algorithm, Q-learning algorithm and deep Q network algorithm. The simulation results show that the proposed algorithm can effectively improve the communication quality and significantly extend the service time of UAV. In addition, the downlink throughput increases with the number of ground users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.