Purpose
To automatically detect and isolate areas of low and high stiffness temporal stability in shear wave elastography (SWE) image sequences and define their impact in chronic liver disease (CLD) diagnosis improvement by means of clinical examination study and deep learning algorithm employing convolutional neural networks (CNNs).
Materials and Methods
Two hundred SWE image sequences from 88 healthy individuals (F0 fibrosis stage) and 112 CLD patients (46 with mild fibrosis (F1), 16 with significant fibrosis (F2), 22 with severe fibrosis (F3), and 28 with cirrhosis (F4)) were analyzed to detect temporal stiffness stability between frames. An inverse Red, Green, Blue (RGB) colormap‐to‐stiffness process was performed for each image sequence, followed by a wavelet transform and fuzzy c‐means clustering algorithm. This resulted in a binary mask depicting areas of high and low stiffness temporal stability. The mask was then applied to the first image of the SWE sequence, and the derived, masked SWE image was used to estimate its impact in standard clinical examination and CNN classification. Regarding the impact of the masked SWE image in clinical examination, one measurement by two radiologists was performed in each SWE image and two in the corresponding masked image measuring areas with high and low stiffness temporal stability. Then, stiffness stability parameters, interobserver variability evaluation and diagnostic performance by means of ROC analysis were assessed. The masked and unmasked sets of SWE images were fed into a CNN scheme for comparison.
Results
The clinical impact evaluation study showed that the masked SWE images decreased the interobserver variability of the radiologists’ measurements in the high stiffness temporal stability areas (interclass correlation coefficient (ICC) = 0.92) compared to the corresponding unmasked ones (ICC = 0.76). In terms of diagnostic accuracy, measurements in the high‐stability areas of the masked SWE images (area‐under‐the‐curve (AUC) ranging from 0.800 to 0.851) performed similarly to those in the unmasked SWE images (AUC ranging from 0.805 to 0.893). Regarding the measurements in the low stiffness temporal stability areas of the masked SWE images, results for interobserver variability (ICC = 0.63) and diagnostic accuracy (AUC ranging from 0.622 to 0.791) were poor. Regarding the CNN classification, the masked SWE images showed improved accuracy (ranging from 82.5% to 95.5%) compared to the unmasked ones (ranging from 79.5% to 93.2%) for various CLD stage combinations.
Conclusion
Our detection algorithm excludes unreliable areas in SWE images, reduces interobserver variability, and augments CNN's accuracy scores for many combinations of fibrosis stages.