2022
DOI: 10.1016/j.jneumeth.2022.109579
|View full text |Cite
|
Sign up to set email alerts
|

Quality assessment of anatomical MRI images from generative adversarial networks: Human assessment and image quality metrics

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 60 publications
0
12
0
Order By: Relevance
“…To the best of our knowledge, there is no single study that addresses what criteria for each image quality metric (e.g., SSIM, PSNR, inception score, and frechet inception distance) in the GAN-based augmentation models are sufficient enough to fool the clinicians into whether the medical images are taken experimentally or generated synthetically. However, some recent studies have benefited the human assessment as a gold standard to compare the synthesized medical samples with original medical images [ 12 , 60 , 61 , 62 ]. Thus, a potential perspective work would be conducted a visual assessment by some blinded radiologists to identify real 18F-FDG PET images versus augmented ones and rate them to find out what range for these quantitative metrics is sufficient enough as a diagnostic criterion.…”
Section: Resultsmentioning
confidence: 99%
“…To the best of our knowledge, there is no single study that addresses what criteria for each image quality metric (e.g., SSIM, PSNR, inception score, and frechet inception distance) in the GAN-based augmentation models are sufficient enough to fool the clinicians into whether the medical images are taken experimentally or generated synthetically. However, some recent studies have benefited the human assessment as a gold standard to compare the synthesized medical samples with original medical images [ 12 , 60 , 61 , 62 ]. Thus, a potential perspective work would be conducted a visual assessment by some blinded radiologists to identify real 18F-FDG PET images versus augmented ones and rate them to find out what range for these quantitative metrics is sufficient enough as a diagnostic criterion.…”
Section: Resultsmentioning
confidence: 99%
“…To further evaluate the difference among datasets from a quantitative perspective, we computed the Fréchet inception distance (FID) (Heusel et al, 2017 ) between each pair of datasets. FID is widely used in the GAN literature and is a popular metric for measuring the feature distance between two distributions, which also shows sensitivity to image quality and good correspondence with human perception (Treder et al, 2022 ). We randomly selected 200 samples of each dataset, took a representative slice (the central slice in the sagittal direction), and computed FID between the datasets.…”
Section: Resultsmentioning
confidence: 99%
“…As an alternative to GANs, other methods using variational autoencoders (VAEs) (Kingma & Welling, 2013) have also been devised, such as Tudosiu et al (2020). However, it has been reported that GANs tend to produce clearer images than VAEs in diffusion-weighted and T1-weighted images (Treder et al, 2022). More recently, latent diffusion models have been proposed for generating synthetic images (Pinaya et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…Naturally, it is impractical to find such an image, because generative models generate images that have a multivariate statistical relationship with a set of real images, but not twins of real images. Therefore, non-reference methods, such as the naturalness image quality evaluator (NIQE), began to be used with a combination of other methods [ 36 ]. The statistical distribution of the image is evaluated not by that of another base image but by calculating the deviations from the statistical regularities of the image itself.…”
Section: Related Workmentioning
confidence: 99%
“…Because of this, it cannot evaluate the similarity between the real set and the synthetic set, it can only evaluate the quality of the image. However, in the work [ 36 ], it was noted that the use of the NIQE method in the earlier epochs of GAN, gives good results. As in the initial epochs of the training, the images will be of poor quality and slightly noisy.…”
Section: Related Workmentioning
confidence: 99%