“…In dynamic scenarios, DRL was applied to solve the routing, modulation, and spectrum assignment (RMSA) problem in single-domain EONs [27,28,30,31], multidomain EONs [32], multiband EONs [33,34] ,and survivable EONs operating under shared protection [35]; the problem of energy-efcient trafc grooming in fog-cloud EONs [36], the problem of establishing and reconfguring multicast sessions in EONs [37], the fragmentation mitigation problem [38], and the resource allocation problem with advanced reservation (AR) in EONs for cloud-edge computing [39]. Only one previous work has studied the application of DRL on MCF networks [40], but this work focused on fxed-grid networks.…”