With the development of today’s society, medical technology is becoming more and more important in people’s daily diagnosis and treatment and the number of computed tomography (CT) images and MRI images is also increasing. It is difficult to meet today’s needs for segmentation and recognition of medical images by manpower alone. Therefore, the use of computer technology for automatic segmentation has received extensive attention from researchers. We design a tooth CT image segmentation method combining attention mechanism and ENet. First, dilated convolution is used with the spatial information path, with a small downsampling factor to preserve the resolution of the image. Second, an attention mechanism is added to the segmentation network based on CT image features to improve the accuracy of segmentation. Then, the designed feature fusion module obtains the segmentation result of the tooth CT image. It was verified on tooth CT image dataset published by West China Hospital, and the average intersection ratio and accuracy were used as the metric. The results show that, on the dataset of West China Hospital, Mean Intersection over Union (MIOU) and accuracy are 83.47% and 95.28%, respectively, which are 3.3% and 8.09% higher than the traditional model. Compared with the multiple watershed algorithm, the Chan–Vese segmentation algorithm, and the graph cut segmentation algorithm, our algorithm increases the calculation time by 56.52%, 91.52%, and 62.96%, respectively. It can be seen that our algorithm has obvious advantages in MIOU, accuracy, and calculation time.