“…From the brief introduction of several commonly used models described above, we know that deep-learning technology has a powerful role in image classification, regression and segmentation. Furthermore, there are also many other kinds of network structures that are applied, such as single shot multibox detection (SSD) 56 , long short-term memory (LSTM) 32 , you only look once (YOLO, source code is available at: http://pjreddie.com/yolo/ ) 57 , regions-CNN (R-CNN) 58 , fast region-based CNNs (Fast R-CNN, source code is available at: https://arxiv.org/abs/1504.08083 ) 33 , faster region-based CNNs (Faster-RCNN, source code is available at: https://github.com/shaoqingren/faster_rcnn (in MATLAB) and at https://github.com/rbgirshick/py-faster-rcnn (in Python)) 44 , 59 , 60 , and so on. In addition, the processed data types are not simply limited to RGB images but can also include any other data forms, such as video, hyperspectral images, and spectral data.…”