In this study cancer screening likelihood method was used to analyze the CT scan group in the National Lung Screening Trial (NLST) data. Three key parameters: screening sensitivity, transition probability density from disease free to preclinical state, and sojourn time in the preclinical state, were estimated using Bayesian approach and Markov Chain Monte Carlo simulations. The sensitivity for lung cancer screening using CT scan is high; it does not depend on a patient's age, and is slightly higher in females than in males. The transition probability from the disease-free to the preclinical state has a peak around age 70 for both genders, which agrees with the fact that the highest lung cancer incidence rate appears between age 65 and 74. The posterior mean sojourn time is around 1.5 years for all groups, and that explains why screening only have a short time interval to catch lung cancer. Accurate estimation of the three key parameters is critical for other estimations such as lead time and over-diagnosis, because these quantities are functions of the three key parameters.
Journal of Biometrics & Biostatisticsandwhere t represents age and x is the sojourn time in the preclinical state S p . We associate the sensitivity β with age t by a logistic link, m is the average age at entry in the whole study group, in this data, m=61.4 years. If b_ 1 >0, then β(t) will be a monotone increasing function of age t. The lognormal distribution was chosen for w(t) with an upper limit of 30%. According to the NIH SEER database, the lifetime risk of lung cancer for the general population is about 7% for both genders [2]. Since participants in the NLST were heavy smokers, the risk would be higher than that, besides the fact that not all people in the preclinical state will progress into clinical cancer. This research proposes 30% as a reasonable upper limit for w(t). A more detailed description of the parametric models can be found in Wu et al. [5,6]. We choose a different sojourn time distribution than Wu et al. [5], where the previous research used log logistic, and we use Weibull distribution here, both share the same property of mathematical simplicity, and both are stable with 2 parameters. However, Weibull is more flexible in that the n-th moments always exist.
ResultsThe six unknown parameters θ=(b 0 , b 1 , μ, σ 2 , λ, α) were estimated based on the NLST data CT arm. We split the data into three groups: male, female and overall. Theoretically, the parameters have a domain of either (-∞,+∞) or (0,+∞). The practical meaning of these parameters will limit them to a finite range. As was described in [5,7], the range for each parameter can be identified as: 0