“…Similarly, ( Dong et al, 2021 ) proposed an MCIF-Net framework that integrates a large receptive field and an effective feature aggregation strategy into a unified framework to extra rich context features for accurate COD. In addition to existing literature, recent advancements, and relevant studies, such as the notable works of ( Hussain et al, 2021 ; Qadeer et al, 2022 ; Naqvi et al, 2023 ), contribute to the understanding of object detection, tracking, and recognition in various contexts, enhancing the breadth and depth of the related literature. Despite research devoted to the challenges in the field of COD to achieve out-standing performance in terms of accuracy, existing deep learning-based COD methods suffer major limitations such as weak boundaries (i.e., edges), low boundary contrast, variations in object appearances, such as object size and shape, leading to unsatisfactory segmentation performance ( Fan et al, 2020a ; Mei et al, 2021 ; Ji et al, 2022 ), and raises the demands of more advanced feature fusion strategies.…”