This study investigates the use of Multi-Agent Reinforcement Learning (MARL) to enhance the efficiency of power system operation and control. The simulated power system environment is represented as a multi-agent system, where intelligent agents are used to mimic generators and loads. The MARL framework utilizes Q-learning algorithms to allow agents to independently adjust their activities in accordance with changing operating circumstances. The resulting simulated data represents a wide-ranging power grid scenario, including buses with different generator capacity, load needs, and transmission line capacities. The findings indicate a significant improvement in the stability of the system via Multi-Agent Reinforcement Learning (MARL), since the agents’ capacity to learn and adapt enables them to quickly alter the outputs of generators and meet the needs of the load, so ensuring that voltage and frequency levels remain within acceptable limits. The MARL framework significantly improves economic efficiency by enabling actors to optimize their behaviors in order to reduce the total costs of the system. The agility of the MARL-based control method is emphasized by the decrease in response time to dynamic disturbances, as agents demonstrate quick and efficient reactions to unforeseen occurrences. The favorable results highlight the potential of MARL as a decentralized decision-making model in power systems, providing advantages in terms of stability, economic efficiency, and the capacity to respond to disruptions. Although the research uses artificial data in a controlled setting, the observed enhancements indicate the flexibility and efficacy of the MARL framework. Future research should prioritize the integration of more practical situations and tackling computational obstacles to further confirm the suitability and expandability of Multi-Agent Reinforcement Learning (MARL) in actual power systems.