Purpose
Deep learning (DL)‐based super‐resolution (SR) reconstruction for magnetic resonance imaging (MRI) has recently been receiving attention due to the significant improvement in spatial resolution compared to conventional SR techniques. Challenges hindering the widespread implementation of these approaches remain, however. Low‐resolution (LR) MRIs captured in the clinic exhibit complex tissue structures obfuscated by noise that are difficult for a simple DL framework to handle. Moreover, training a robust network for a SR task requires abundant, perfectly matched pairs of LR and high‐resolution (HR) images that are often unavailable or difficult to collect. The purpose of this study is to develop a novel SR technique for MRI based on the concept of cascaded DL that allows for the reconstruction of high‐quality SR images in the presence of insufficient training data, an unknown translation model, and noise.
Methods
The proposed framework, based on the concept named cascaded deep learning, consists of three components: (a) a denoising autoencoder (DAE) trained using clinical LR noisy MRI scans that have been processed with a nonlocal means filter that generates denoised LR data; (b) a down‐sampling network (DSN) trained with a small amount of paired LR/HR data from volunteers that allows for the generation of perfectly paired LR/HR data for the training of a generative model; and (c) the proposed SR generative model (p‐SRG) trained with data generated by the DSN that maps from LR inputs to HR outputs. After training, LR clinical images may be fed through the DAE and p‐SRG to yield SR reconstructions of the LR input. The application of this framework was explored in two settings: 3D breath‐hold MRI axial SR reconstruction from LR axial scans (<3 sec/vol) and in the enhancement of the spatial resolution of LR 4D‐MRI acquisitions (0.5 sec/vol).
Results
The DSN produces LR scans from HR inputs with a higher fidelity to true, LR clinical scans compared to conventional k‐space down‐sampling methods based on the metrics of root mean square error (RMSE) and structural similarity index (SSIM). Furthermore, HR outputs generated by the p‐SRG exhibit improved scores in the peak signal‐to‐noise ratio, normalized RMSE, SSIM, and in the blind/reference‐less image spatial quality evaluator assessment compared to conventional approaches to MRI SR.
Conclusions
The robust, SR reconstruction method for MRI based on the novel cascaded deep learning framework is an end‐to‐end method for producing detail‐preserving SR reconstructions from noisy, LR clinical MRI scans. Fourfold enhancements in spatial resolution facilitate target delineation and motion management during radiation therapy, enabling precise MRI‐guided radiation therapy with 3D LR breath‐hold MRI and 4D‐MRI in a clinically feasible time frame.