In metropolitan areas, traffic jams on city streets are a major source of annoyance and financial losses. Recent advancements in data processing algorithms and the widespread availability of traffic detectors have made it possible to implement data-driven strategies for reducing traffic congestion. In order to benefit from intersection cooperation in this setting, this paper presents a distributed control strategy based on RL. In this scenario, traffic prediction software's embedding that takes into account the state of nearby junctions is used to synthesize an RL controller that controls the traffic lights. Loop detector characteristics are insufficient for precise data imputed in sophisticated traffic control systems. Most current imputation methods only use these extracted characteristics, which leads to the creation of data replicas that lack the necessary precision. The clean data are first given a statistical multi-class label, with classes ranging from C1 to Cn. Then, using a deep recurrent neural network (RNN) model, the best data model is created from the labelled spotless data and applied to the class of models in the missed-volume data. Results from simulations using TRANSYT demonstrate that the suggested strategy outperforms conventional methods in terms of waiting times and other important presentation indices.