A review on multimodal machine learning in medical diagnostics

Yan, Keyue; Li, Tengyue; Marques, João Alexandre Lôbo; Gao, Juntao; Fong, Simon

doi:10.3934/mbe.2023382

Cited by 9 publications

(3 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The self-attention mechanism combined a deep learning encoder-decoder for the segmentation task 25 . The need to eliminate noise and distortion in data stream associated with electrocardiography (ECG) has been addressed using multimodal deep learning method which combines other data streams for improved diagnosis 30 . The fusion of data streams from several 3D neuroimaging into a pattern representing an informative latent embedding has been investigated.…”

Section: Related Workmentioning

confidence: 99%

“…Furthermore, the increasing nature of multimodality in biomedical data and the constrained information represented in a single modality are motivations for obtaining sufficient information for disease diagnosis 29 . With this, the unimodal learning is fast becoming obsolete so that the multimodal represents state-of-the-art owning to its capability to improve the robustness of models with the diversity of data 30 , and this has been widely applied to speech recognition, image processing, sentiment analysis and forensic applications. The multimodal approach has the advantage of uniformly analyzing heterogeneous features and fuses them into a common representational space.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A twin convolutional neural network with hybrid binary optimizer for multimodal breast cancer digital image classification

Oyelade,

Irunokhai,

Wang

2024

Sci Rep

View full text Add to dashboard Cite

There is a wide application of deep learning technique to unimodal medical image analysis with significant classification accuracy performance observed. However, real-world diagnosis of some chronic diseases such as breast cancer often require multimodal data streams with different modalities of visual and textual content. Mammography, magnetic resonance imaging (MRI) and image-guided breast biopsy represent a few of multimodal visual streams considered by physicians in isolating cases of breast cancer. Unfortunately, most studies applying deep learning techniques to solving classification problems in digital breast images have often narrowed their study to unimodal samples. This is understood considering the challenging nature of multimodal image abnormality classification where the fusion of high dimension heterogeneous features learned needs to be projected into a common representation space. This paper presents a novel deep learning approach combining a dual/twin convolutional neural network (TwinCNN) framework to address the challenge of breast cancer image classification from multi-modalities. First, modality-based feature learning was achieved by extracting both low and high levels features using the networks embedded with TwinCNN. Secondly, to address the notorious problem of high dimensionality associated with the extracted features, binary optimization method is adapted to effectively eliminate non-discriminant features in the search space. Furthermore, a novel method for feature fusion is applied to computationally leverage the ground-truth and predicted labels for each sample to enable multimodality classification. To evaluate the proposed method, digital mammography images and digital histopathology breast biopsy samples from benchmark datasets namely MIAS and BreakHis respectively. Experimental results obtained showed that the classification accuracy and area under the curve (AUC) for the single modalities yielded 0.755 and 0.861871 for histology, and 0.791 and 0.638 for mammography. Furthermore, the study investigated classification accuracy resulting from the fused feature method, and the result obtained showed that 0.977, 0.913, and 0.667 for histology, mammography, and multimodality respectively. The findings from the study confirmed that multimodal image classification based on combination of image features and predicted label improves performance. In addition, the contribution of the study shows that feature dimensionality reduction based on binary optimizer supports the elimination of non-discriminant features capable of bottle-necking the classifier.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A twin convolutional neural network with hybrid binary optimizer for multimodal breast cancer digital image classification

Oyelade,

Irunokhai,

Wang

2024

Sci Rep

View full text Add to dashboard Cite

show abstract

“…Ongoing research into computer vision algorithms, utilizing convolutional neural networks for the analysis of medical images, holds the promise of more accurate diagnoses ( 15 , 16 ). Future developments may even witness integration of sophisticated multi-modal algorithms, combining diverse data sources for highly precise predictions of specific medical conditions ( 17 ).…”

Section: Ai Throughout the Pediatric Patient Journeymentioning

confidence: 99%