Background
Medical images have already become an essential tool for the diagnosis of many diseases. Thus a large number of medical images are being generated due to the daily routine inspection. An efficient image‐based disease retrieval system will not only make full use of existing data, but also help physicians to prognosis the diseases. Medical image retrieval is represented by the classification and localization of common thorax diseases in x‐ray images. Although extensive efforts have been put into this field, there are still many challenges.
Purpose
Most of the existing fine‐grained image research methods just apply existing deep learning frameworks in extracting the image features. However, these high‐level features mainly focus on the global representations of the object, rather than simultaneously considering the local ones. It requires fine‐grained details to classify the images with similar lesion areas. Thus, it is necessary to combine the global features and local ones to make the features more discriminative. On the other hand, training CNN models based on current existing strategies have a high time complexity, and is hard to get the discriminative features mentioned above. In addition, the visual retrieval method of fine‐grained medical images still has the problem of insufficient sample data with accurate annotation information.
Methods
To address above challenges, we introduced a novel fine‐grained medical images retrieval method. First, a centralized contrastive loss (CCLoss) is proposed as our metric learning loss function. Parameters are updated by using the center point, which not only improves the distinguishing performance of features, but also effectively reduces the time complexity of the algorithm. In addition, a weakly supervised progressive feature extraction method is proposed to gradually extract the combined features. And the attention mechanism module is applied to screen the target information after the initial positioning for fine refinement, so as to separate the features with a high degree of discrimination. The retrieval of 14 different chest diseases is evaluated on the chest x‐ray datasets.
Results
Compared with the existing research methods, the proposed method shows a better retrieval result for Recall@8 by 2.26%∼4.6%$\%{\sim }4.6\%$ and achieves a very efficient training speed which is 100 times faster than the pair‐wise loss‐based training strategy. We also assessed the effects of Recall@k (k = 2, 4, 6, 8) for progressive features extracted from different steps to obtain a model with the best retrieval performance.
Conclusions
The proposed model is capable of learning discriminative representations from chest x‐ray datasets, and it achieves better performance compared with other state‐of‐the‐art methods. Therefore, the developed model would be useful in the diagnosis of common thorax disease or unknown chest disease.