Over 13% of the global population (most of which are rural communities) still lack access to electricity. A typical resolution to this would be to generate more electricity from existing power generation infrastructure. However, the urgency to meet net-zero global greenhouse gas emissions means that this resolution may not be the way forward. Instead, policymakers must consider decarbonization strategies such as renewable energy systems to generate more electricity in rural communities. As policymakers aim to encourage renewable energy generation, existing power plant operators may not share the same perspective. Operators typically wish to ensure profit margins in their operations as decarbonization efforts may be costly and reduce the profit. A balance must be struck between both parties so that the energy sector can continue to meet rising energy demands and decarbonization needs. This is a classic leader–follower situation where it involves the interplay between policymaker (as energy sector regulator) and industry (as energy sector investor). This work presents a bi-level optimization model to address the leader–follower interactions between policymakers and industry operators. The proposed model considers factors such as total investment, co-firing opportunities, incentives, disincentives, carbon emissions, scale, cost, and efficiency to meet electricity demands. To demonstrate the model, two Malaysian case studies were evaluated and presented. The first optimized networks is developed based on different energy demands. Results showed that when cost was minimized, the production capacity of the existing power plants was increased and renewable energy systems were not be selected. The second case study used bi-level optimization to determine an optimal trade-off $ 1.4 million in incentives per year, which serves as a monetary sum needed by policymakers to encourage industry operators to decarbonize their operations. Results from the second case were then compared to the ones in the first.