“…Studies have demonstrated the capabilities of these techniques, particularly deep convolutional networks (CNN; LeCun et al, 1998;Lecun et al, 2015) in detecting and classifying fish in underwater images or video streams (Salman et al, 2016;Villon et al, 2018, see reviews in Goodwin et al, 2022;Li and Du, 2022;Mittal et al, 2022;Saleh, Sheaves and Rahimi Azghadi, 2022). These studies have utilized different types of image databases and have faced similar unresolved questions, such as the number of fish needed for training ( Marrable et al, 2022), the need for color image pre-processing (e.g., Lisani et al, 2022), the need for transfer learning from large databases (e.g., Imagenet or coco), improving results when working with small image areas or limited computing power (Paraschiv et al, 2022), whether to use segmentation of bounding boxes and how well a trained set will perform for different habitats (backgrounds). In particular, the detection and classification of multiple species using different combinations of backgrounds (the "domain-shift" phenomenon: Kalogeiton et al, 2016;Ditria et al, 2020), number of species, and labeling quality, is an area that requires further investigation.…”