Objective: 3D reconstruction of lumbar intervertebral foramen (LIVF) has been beneficial in evaluating surgical trajectory. Still, the current methods of reconstructing the 3D LIVF model are mainly based on manual segmentation, which is laborious and time-consuming. This study aims to explore the feasibility of automatically segmenting lumbar spinal structures and increasing the speed and accuracy of 3D lumbar intervertebral foramen (LIVF) reconstruction on magnetic resonance image (MRI) at the L4-5 level.Methods: A total of 100 participants (mean age: 42.2 AE 14.0 years; 52 males and 48 females; mean body mass index, 22.7 AE 3.2 kg/m 2 ), were enrolled in this prospective study between March and July 2020. All participants were scanned on L4-5 level with a 3T MR unit using 3D T2-weighted sampling perfection with applicationoptimized contrast with various flip-angle evolutions (SPACE) sequences. The lumbar spine's vertebra bone structures (VBS) and intervertebral discs (IVD) were manually segmented by skilled surgeons according to their anatomical outlines from MRI. Then all manual segmentation were saved and used for training. An automated segmentation method based on a 3D U-shaped architecture network (3D-UNet) was introduced for the automated segmentation of lumbar spinal structures. A number of quantitative metrics, including dice similarity coefficient (DSC), precision, and recall, were used to evaluate the performance of the automated segmentation method on MRI. Wilcoxon signed-rank test was applied to compare morphometric parameters, including foraminal area, height and width of 3D LIVF models between automatic and manual segmentation. The intra-class correlation coefficient was used to assess the test-retest reliability and inter-observer reliability of multiple measurements for these morphometric parameters of 3D LIVF models.
Results:The automatic segmentation performance of all spinal structures (VBS and IVD) was found to be 0.918 (healthy levels: 0.922; unhealthy levels: 0.916) for the mean DSC, 0.922 (healthy levels: 0.927; unhealthy levels: 0.920) for the mean precision, and 0.917 (healthy levels: 0.918; unhealthy levels: 0.917) for the mean recall in the test dataset. It took approximately 2.5 s to achieve each automated segmentation, far less than the 240 min for manual segmentation. Furthermore, no significant differences were observed in the foraminal area, height and width of the 3D LIVF models between manual and automatic segmentation images (P > 0.05).