The proliferation of grid-connected photovoltaic (PV) systems has generated considerable apprehension among power system operators due to worries about electricity quality, leading to the implementation of increasingly strict standards and regulations. Inter-harmonics and DC offset have emerged as prominent power quality issues in grid-connected photovoltaic (PV) systems, constituting significant obstacles. This article provides a thorough examination of the methods used to improve the performance of a three-phase grid-connected photovoltaic (PV) system, with a specific focus on mitigating inter-harmonics and DC offset. The presence of inter-harmonics and DC offset may have a substantial negative impact on the overall performance of a system, resulting in compromised power quality and diminished energy extraction capabilities. In order to address these challenges, a method known as ensembled Deep Reinforcement learning (EDRL) Maximum electricity Point Tracking (MPPT) is used to optimize the extraction of electricity from the photovoltaic (PV) array. Furthermore, the integration of a Coati Optimization Algorithm (COA) with a fuzzified Phase-Locked Loop (PLL) synchronization mechanism is used to ensure precise synchronization with the grid. The EDRL MPPT approach demonstrates a proficient ability to accurately monitor and follow the maximum power point of the photovoltaic (PV) array. This is achieved by using a reward system that is based on the lowest overall harmonic distortion in the grid current. The COA (Centralized Optimization Algorithm) is used to effectively tune the hyperparameters of the fuzzy system. The primary objective of this optimization process is to reduce the DC offset, hence ensuring a steady and precise synchronization between the fuzzy system and the grid. The efficacy of the proposed system is assessed by means of comprehensive simulations and experimental validation. The findings of this study provide evidence supporting the efficacy of the Enhanced Distributed Reactive Load Maximum Power Point Tracking (EDRL MPPT) approach in optimizing power extraction and reducing the impact of inter-harmonics. The COA-fuzzified-PLL synchronization system is designed to provide precise grid synchronization while mitigating the adverse effects of a 2.89% total harmonic distortion (THD) in grid current, particularly the influence of direct current (DC) offset. The integration of many approaches presents notable improvements in terms of power quality, energy extraction efficiency, and system stability.