Compared with natural images, underwater images are usually degraded with blur, scale variation, colour shift and texture distortion, which bring much challenge for computer vision tasks like object detection. In this case, generic object detection methods usually fail to achieve satisfactory performance. The main reason is considered that the current methods lack sufficient discriminativeness of feature representation for the degraded underwater images. A a novel multi-scale feature representation and interaction network for underwater object detection is proposed, in which two core modules are elaborately designed to enhance the discriminativeness of feature representation for underwater images. The first is the Context Integration Module, which extracts rich context information from high-level features and is integrated with the feature pyramid network to enhance the feature representation in a multi-scale way. The second is the Dual-refined Attention Interaction Module, which further enhances the feature representation by sufficient interactions between different levels of features both in channel and spatial domains based on attention mechanism. The proposed model is evaluated on four public underwater datasets. The experimental results compared with state-of-the-art object detection methods show that the proposed model has leading performance, which verifies that it is effective for underwater object detection. In addition, object detection experiments on a foggy dataset of Real-world Task-driven Testing Set (RTTS) and the natural image dataset of pattern analysis statistical modelling and computational learning, visual object classes (PASCAL VOC) are conducted. The results show that the proposed model can be applied on the degraded dataset of RTTS but fails on PASCAL VOC.
K E Y W O R D S computer vision, convolutional neural nets, object detection
| INTRODUCTIONAs a fundamental and important problem in computer vision, object detection has been continuously investigated in the past decades. With the development of deep learning in recent years, object detection has made great progression and been widely applied in various real-world applications, such as vehicle detection [1-4], pedestrian detection [5-9], intelligent monitoring [10-13], etc.Although the generic object detection algorithms have promising performance in most cases, they often fail to achieve convincing results under some specific scenes, such as underwater, foggy and dark light, etc. Compared with the common scenarios, the object detection in these specific scenes faces much challenge and difficulty to overcome, including image blur, texture distortion, imbalanced illumination, etc. For this purpose, many researchers began and focussed on the object detection in specific scenes. For example, underwater object detection [14] has raised much attention recently, as it is an important and common problem in the fields of seafood industry, underwater engineering and ocean development. Specifically, for the aquaculture industry, underwater robots ...