Automatic Vertebra Labeling in Large-Scale 3D CT Using Deep Image-to-Image Network with Message Passing and Sparsity Regularization

Yang, Dong; Xiong, Tao; Xu, Daguang; Huang, Qiangui; Liu, David; Zhou, S. Kevin; Xu, Zhoubing; Park, JinHyeong; Chen, Mingqing; Tran, Trac D.; Chin, Sang; Metaxas, Dimitris N.; Comanicìu, Dorin

doi:10.1007/978-3-319-59050-9_50

Cited by 86 publications

(59 citation statements)

References 14 publications

(30 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is considered challenging due to the variety of pathological cases, arbitrary of field-of-view and the existence of artificial implants. For all the experiments, we use the official split for training and testing as did by other stateof-the-art methods [19], [20], [24]. In total, there are 302 CT scans in this dataset.…”

Section: Methodsmentioning

confidence: 99%

“…However, as denoted in [23] and also demonstrated in this work, 2D CNNs do not work well in detection problems as they cannot capture the 3D spatial information that is critical to the detection of the target object. More recently, Dong et al [24] proposed a 3D U-Net [25] like architecture to target the vertebrae localization problem in an image-to-image fashion. However, the proposed architecture can not fully address the long-term contextual information in spinal images.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Joint Vertebrae Identification and Localization in Spinal CT Images by Combining Short- and Long-Range Contextual Information

Liao¹,

Mesfin²,

Luo³

2018

IEEE Trans. Med. Imaging

View full text Add to dashboard Cite

Automatic vertebrae identification and localization from arbitrary computed tomography (CT) images is challenging. Vertebrae usually share similar morphological appearance. Because of pathology and the arbitrary field-of-view of CT scans, one can hardly rely on the existence of some anchor vertebrae or parametric methods to model the appearance and shape. To solve the problem, we argue that: 1) one should make use of the short-range contextual information, such as the presence of some nearby organs (if any), to roughly estimate the target vertebrae; and 2) due to the unique anatomic structure of the spine column, vertebrae have fixed sequential order, which provides the important long-range contextual information to further calibrate the results. We propose a robust and efficient vertebrae identification and localization system that can inherently learn to incorporate both the short- and long-range contextual information in a supervised manner. To this end, we develop a multi-task 3-D fully convolutional neural network to effectively extract the short-range contextual information around the target vertebrae. For the long-range contextual information, we propose a multi-task bidirectional recurrent neural network to encode the spatial and contextual information among the vertebrae of the visible spine column. We demonstrate the effectiveness of the proposed approach on a challenging data set, and the experimental results show that our approach outperforms the state-of-the-art methods by a significant margin.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Joint Vertebrae Identification and Localization in Spinal CT Images by Combining Short- and Long-Range Contextual Information

Liao¹,

Mesfin²,

Luo³

2018

IEEE Trans. Med. Imaging

View full text Add to dashboard Cite

show abstract

“…Performance comparison of our approach (setting T = 0, for a fair comparison) with Glocker et al[3], Chen et al[1], & Yang et al[8]. DI2IN refers to stand-alone FCN, while DI2IN* includes use of message passing and shape dictionary.…”

mentioning

confidence: 99%

Btrfly Net: Vertebrae Labelling with Energy-Based Adversarial Learning of Local Spine Prior

Sekuboyina

Rempfler

Kukačka

et al. 2018

Medical Image Computing and Computer Assisted Intervention – MICCAI 2018

View full text Add to dashboard Cite

Robust localisation and identification of vertebrae is essential for automated spine analysis. The contribution of this work to the task is two-fold: (1) Inspired by the human expert, we hypothesise that a sagittal and coronal reformation of the spine contain sufficient information for labelling the vertebrae. Thereby, we propose a butterfly-shaped network architecture (termed Btrfly Net) that efficiently combines the information across reformations. (2) Underpinning the Btrfly net, we present an energy-based adversarial training regime that encodes local spine structure as an anatomical prior into the network, thereby enabling it to achieve state-of-art performance in all standard metrics on a benchmark dataset of 302 scans without any post-processing during inference.

show abstract

“…Music generation C-RNN-GAN [83], SeqGAN [141], ORGAN [41] Text generation RankGAN [73] Speech conversion VAW-GAN [48] Semi-supervised learning SSL-GAN [104], CatGAN [115], Triple-GAN [67] Others Domain adaptation DANN [2], CyCADA [47] Unsupervised pixel-level domain adaptation [12] Continual learning Deep generative replay [110] Medical image segmentation DI2IN [136], SCAN [16], SegAN [134] Steganography Steganography GAN [124], Secure steganography GAN [109] is more likely to be real. G and D compete with each other to achieve their individual goals, thus generating the term adversarial.…”

Section: Dcgan [100] Hierarchymentioning

confidence: 99%

“…This structure leads a segmentor to learn the features of the ground-truth segmentation adversarially similar to the GAN approach. There are also other medical image segmentation algorithms such as the deep image-to-image network (DI2IN) [136] and structure correcting adversarial network (SCAN) [16]. DI2IN conducts liver segmentation of 3D CT images through adversarial learning.…”

Section: Medical Image Segmentationmentioning

confidence: 99%

How Generative Adversarial Networks and Their Variants Work

et al. 2019

View full text Add to dashboard Cite

Generative Adversarial Networks (GAN) have received wide attention in the machine learning field for their potential to learn high-dimensional, complex real data distribution. Specifically, they do not rely on any assumptions about the distribution and can generate real-like samples from latent space in a simple manner. This powerful property leads GAN to be applied to various applications such as image synthesis, image attribute editing, image translation, domain adaptation and other academic fields. In this paper, we aim to discuss the details of GAN for those readers who are familiar with, but do not comprehend GAN deeply or who wish to view GAN from various perspectives. In addition, we explain how GAN operates and the fundamental meaning of various objective functions that have been suggested recently. We then focus on how the GAN can be combined with an autoencoder framework. Finally, we enumerate the GAN variants that are applied to various tasks and other fields for those who are interested in exploiting GAN for their research.

show abstract

Automatic Vertebra Labeling in Large-Scale 3D CT Using Deep Image-to-Image Network with Message Passing and Sparsity Regularization

Cited by 86 publications

References 14 publications

Joint Vertebrae Identification and Localization in Spinal CT Images by Combining Short- and Long-Range Contextual Information

Joint Vertebrae Identification and Localization in Spinal CT Images by Combining Short- and Long-Range Contextual Information

Btrfly Net: Vertebrae Labelling with Energy-Based Adversarial Learning of Local Spine Prior

How Generative Adversarial Networks and Their Variants Work

Contact Info

Product

Resources

About