Cephalometric analysis is a standard diagnostic tool in orthodontics and craniofacial surgery. Today, as conventional 2D cephalometry is limited and susceptible to analysis bias, a more reliable and user-friendly three-dimensional system that includes hard tissue, soft tissue, and airways is demanded in clinical practice. We launched our study to develop such a system based on CT data and landmarks. This study aims to determine whether the data labeled through our process is highly qualified and whether the soft tissue and airway data derived from CT scans are reliable. We enrolled 15 patients (seven males, eight females, 26.47 ± 3.44 years old) diagnosed with either non-syndromic dento–maxillofacial deformities or OSDB in this study to evaluate the intra- and inter-examiner reliability of our system. A total of 126 landmarks were adopted and divided into five sets by region: 28 cranial points, 25 mandibular points, 20 teeth points, 48 soft tissue points, and 6 airway points. All the landmarks were labeled by two experienced clinical practitioners, either of whom had labeled all the data twice at least one month apart. Furthermore, 78 parameters of three sets were calculated in this study: 42 skeletal parameters (23 angular and 19 linear), 27 soft tissue parameters (9 angular and 18 linear), and 9 upper airway parameters (2 linear, 4 areal, and 3 voluminal). Intraclass correlation coefficient (ICC) was used to evaluate the inter-examiner and intra-examiner reliability of landmark coordinate values and measurement parameters. The overwhelming majority of the landmarks showed excellent intra- and inter-examiner reliability. For skeletal parameters, angular parameters indicated better reliability, while linear parameters performed better for soft tissue parameters. The intra- and inter-examiner ICCs of airway parameters referred to excellent reliability. In summary, the data labeled through our process are qualified, and the soft tissue and airway data derived from CT scans are reliable. Landmarks that are not commonly used in clinical practice may require additional attention while labeling as they are prone to poor reliability. Measurement parameters with values close to 0 tend to have low reliability. We believe this three-dimensional cephalometric system would reach clinical application.