2023
DOI: 10.1007/s11042-023-14940-x
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal medical tensor fusion network-based DL framework for abnormality prediction from the radiology CXRs and clinical text reports

Abstract: Pulmonary disease is a commonly occurring abnormality throughout this world. The pulmonary diseases include Tuberculosis, Pneumothorax, Cardiomegaly, Pulmonary atelectasis, Pneumonia, etc. A timely prognosis of pulmonary disease is essential. Increasing progress in Deep Learning (DL) techniques has significantly impacted and contributed to the medical domain, specifically in leveraging medical imaging for analysis, prognosis, and therapeutic decisions for clinicians. Many contemporary DL strategies for radiolo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 64 publications
0
4
0
Order By: Relevance
“…The suggested multichannel dilation convolution layer provides more comprehensive imaging data by generating a larger receptive field, while keeping the network parameters constant, in contrast to the traditional convolutional layer. Moreover, to ensure an even distribution of computational workload across each layer, the Depthwise Separable convolution network is utilized instead of the conventional convolution network [23]. The UM-VES framework is used to extract visual features from both the frontal and lateral CXR images independently, and the resulting features are combined by concatenation.…”
Section: Unimodal Medical Visual Encoding Subnetwork (Um-ves)mentioning
confidence: 99%
See 2 more Smart Citations
“…The suggested multichannel dilation convolution layer provides more comprehensive imaging data by generating a larger receptive field, while keeping the network parameters constant, in contrast to the traditional convolutional layer. Moreover, to ensure an even distribution of computational workload across each layer, the Depthwise Separable convolution network is utilized instead of the conventional convolution network [23]. The UM-VES framework is used to extract visual features from both the frontal and lateral CXR images independently, and the resulting features are combined by concatenation.…”
Section: Unimodal Medical Visual Encoding Subnetwork (Um-ves)mentioning
confidence: 99%
“…The resulting features are then concatenated and passed through 13 depthwise separable layers to learn and extract additional features. For a more comprehensive understanding of the model, readers can refer to our previous paper [23], where we provide a detailed overview and description of the UM-VES model's architecture and components.…”
Section: Unimodal Medical Visual Encoding Subnetwork (Um-ves)mentioning
confidence: 99%
See 1 more Smart Citation
“…The unstructured nature of radiology free-text reports with complex vocabularies makes it difficult for ML/DL models to extract features from the raw text. NLP plays a key role in extracting structured information from clinical text (Pons et al, 2016;Shetty et al, 2023).…”
Section: Introductionmentioning
confidence: 99%