Loss Weightings for Improving Imbalanced Brain Structure Segmentation Using Fully Convolutional Networks

Sugino, Takaaki; Kawase, Toshihiro; Onogi, Shinya; Kin, Taichi; Saito, Nobuhito; Nakajima, Yoshikazu

doi:10.3390/healthcare9080938

Cited by 32 publications

(14 citation statements)

References 32 publications

(58 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our intuition in this regard is that this is due, mainly, to the class imbalance problem and lack of enough data to train the model for those structures specifically. For this reason, it is important in the future to explore other methods that allow addressing this problem, for example, improving the calculation of the weights of the classes used in the loss functions similar to what is performed in [ 74 ], or using additional data augmentation techniques to increase the samples of classes with less information. Another factor that we considered in the analysis is the fact that deep learning methods based on transformers lack the inductive biases inherent in CNNs requiring large amounts of data to be able to generalize well [ 77 ], so their usage in small-size medical datasets remains difficult without any internal modification in their self-attention module.…”

Section: Discussion and Future Workmentioning

confidence: 99%

See 1 more Smart Citation

Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images

Laiton-Bonadiez

Sánchez-Torres

Bedoya

2022

Sensors

View full text Add to dashboard Cite

In recent years, the use of deep learning-based models for developing advanced healthcare systems has been growing due to the results they can achieve. However, the majority of the proposed deep learning-models largely use convolutional and pooling operations, causing a loss in valuable data and focusing on local information. In this paper, we propose a deep learning-based approach that uses global and local features which are of importance in the medical image segmentation process. In order to train the architecture, we used extracted three-dimensional (3D) blocks from the full magnetic resonance image resolution, which were sent through a set of successive convolutional neural network (CNN) layers free of pooling operations to extract local information. Later, we sent the resulting feature maps to successive layers of self-attention modules to obtain the global context, whose output was later dispatched to the decoder pipeline composed mostly of upsampling layers. The model was trained using the Mindboggle-101 dataset. The experimental results showed that the self-attention modules allow segmentation with a higher Mean Dice Score of 0.90 ± 0.036 compared with other UNet-based approaches. The average segmentation time was approximately 0.038 s per brain structure. The proposed model allows tackling the brain structure segmentation task properly. Exploiting the global context that the self-attention modules incorporate allows for more precise and faster segmentation. We segmented 37 brain structures and, to the best of our knowledge, it is the largest number of structures under a 3D approach using attention mechanisms.

show abstract

Section: Discussion and Future Workmentioning

confidence: 99%

“…Even the size difference between the structures and the background is usually significant. Therefore, multiple loss functions and weighting strategies for loss functions were proposed for improving imbalanced brain structure segmentation [ 74 ]. In the proposed approach, we used a combination of Dice Loss [ 75 ] and Focal Loss [ 76 ].…”

Section: Methodsmentioning

confidence: 99%

Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images

Laiton-Bonadiez

Sánchez-Torres

Bedoya

2022

Sensors

View full text Add to dashboard Cite

show abstract

“…The former could be addressed by scanning with smaller voxel dimensions [3] or by resampling scans into smaller voxel dimensions during the preprocessing steps. The latter could be addressed by implementing a class balancing scheme according to the pixel-wise frequency of each class in the dataset [52]. Since the goal of the current paper was to assess the value of synth-DECT scans, we did not implement class balancing schemes to mitigate the errors found at the margins of the scans.…”

Section: Discussionmentioning

confidence: 99%

Deep Learning and Domain-Specific Knowledge to Segment the Liver from Synthetic Dual Energy CT Iodine Scans

et al. 2022

View full text Add to dashboard Cite

We map single energy CT (SECT) scans to synthetic dual-energy CT (synth-DECT) material density iodine (MDI) scans using deep learning (DL) and demonstrate their value for liver segmentation. A 2D pix2pix (P2P) network was trained on 100 abdominal DECT scans to infer synth-DECT MDI scans from SECT scans. The source and target domain were paired with DECT monochromatic 70 keV and MDI scans. The trained P2P algorithm then transformed 140 public SECT scans to synth-DECT scans. We split 131 scans into 60% train, 20% tune, and 20% held-out test to train four existing liver segmentation frameworks. The remaining nine low-dose SECT scans tested system generalization. Segmentation accuracy was measured with the dice coefficient (DSC). The DSC per slice was computed to identify sources of error. With synth-DECT (and SECT) scans, an average DSC score of 0.93±0.06 (0.89±0.01) and 0.89±0.01 (0.81±0.02) was achieved on the held-out and generalization test sets. Synth-DECT-trained systems required less data to perform as well as SECT-trained systems. Low DSC scores were primarily observed around the scan margin or due to non-liver tissue or distortions within ground-truth annotations. In general, training with synth-DECT scans resulted in improved segmentation performance with less data.

show abstract

“…To reduce memory consumption without impacting performance, we selected a basis of 24 filters of 3 × 3 − 24 for the first layer, 48 for the second and so on, as proposed in [23]. The loss function (L) was a combination of binary cross-entropy (L BCE ) [24] and Dice loss (L DL ) [25] which is demonstrated to be well suited for imbalanced structure segmentation [26]. L was defined as:…”

Section: Ai Training Frameworkmentioning

confidence: 99%

Deep Learning for the Automatic Quantification of Pleural Plaques in Asbestos-Exposed Subjects

Benlala

Dournes

Menant

et al. 2022

IJERPH

View full text Add to dashboard Cite

Objective: This study aimed to develop and validate an automated artificial intelligence (AI)-driven quantification of pleural plaques in a population of retired workers previously occupationally exposed to asbestos. Methods: CT scans of former workers previously occupationally exposed to asbestos who participated in the multicenter APEXS (Asbestos PostExposure Survey) study were collected retrospectively between 2010 and 2017 during the second and the third rounds of the survey. A hundred and forty-one participants with pleural plaques identified by expert radiologists at the 2nd and the 3rd CT screenings were included. Maximum Intensity Projection (MIP) with 5 mm thickness was used to reduce the number of CT slices for manual delineation. A Deep Learning AI algorithm using 2D-convolutional neural networks was trained with 8280 images from 138 CT scans of 69 participants for the semantic labeling of Pleural Plaques (PP). In all, 2160 CT images from 36 CT scans of 18 participants were used for AI testing versus ground-truth labels (GT). The clinical validity of the method was evaluated longitudinally in 54 participants with pleural plaques. Results: The concordance correlation coefficient (CCC) between AI-driven and GT was almost perfect (>0.98) for the volume extent of both PP and calcified PP. The 2D pixel similarity overlap of AI versus GT was good (DICE = 0.63) for PP, whether they were calcified or not, and very good (DICE = 0.82) for calcified PP. A longitudinal comparison of the volumetric extent of PP showed a significant increase in PP volumes (p < 0.001) between the 2nd and the 3rd CT screenings with an average delay of 5 years. Conclusions: AI allows a fully automated volumetric quantification of pleural plaques showing volumetric progression of PP over a five-year period. The reproducible PP volume evaluation may enable further investigations for the comprehension of the unclear relationships between pleural plaques and both respiratory function and occurrence of thoracic malignancy.

show abstract

Loss Weightings for Improving Imbalanced Brain Structure Segmentation Using Fully Convolutional Networks

Cited by 32 publications

References 32 publications

Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images

Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images

Deep Learning and Domain-Specific Knowledge to Segment the Liver from Synthetic Dual Energy CT Iodine Scans

Deep Learning for the Automatic Quantification of Pleural Plaques in Asbestos-Exposed Subjects

Contact Info

Product

Resources

About