of combined pulmonary fibrosis and emphysema and idiopathic pulmonary fibrosis: over a 3-year follow-up. Tuberc Respir Dis (Seoul) We read with interest the article by Domsic et al (1), describing a study in which the authors aimed to build a predictive score for 5-year mortality in diffuse cutaneous systemic sclerosis (dcSSc). We would like to take the opportunity, using their article as an example, to discuss what could be considered good practice when developing predictive scores and what could probably improve the results of the analysis. We also would like to address the potential use of their score as an end point in clinical trials.In their multivariable analyses, Dr. Domsic and colleagues appropriately retained a limited number of predictors as allowed by the sample size of the study, i.e., 6 predictors for 110 patients who were deceased after 5 years versus 278 patients who were still living. These numbers are consistent with some previously published guidance on limiting model overfitting (2,3) when unpenalized likelihood is used to estimate the weights of the predictors. Moreover, despite the fact that the authors did not perform an internal validation to correct their findings from optimism (using, for example, bootstrap resampling techniques), they conducted a proper external validation by applying the constructed score to an independent data set. This is, in our view, a major asset of their study. This type of initiative remains, unfortunately, rare in published studies.In their analysis, Domsic et al obtained an area under the curve (AUC) of 69% in the independent validation data set. There are several methodologic approaches that could lead to an improvement of the model performance.First, the predictive performance of the model could gain accuracy by use of alternative predictor selection methods. Indeed, stepwise selection has been widely criticized over the past years because of its propensity to retain uninformative predictors in the model when the number of candidates is large, and to provide inflated estimates (biased high) of their coefficient (3) (the "beta weights" in the report by Domsic et al). Prefiltering the predictors according to their univariate significance, based on principles very similar to the first stage of forward variable selection, has also been discouraged, because this procedure discards predictors that could become important only after other specific ones have been included (4). As an alternative, the LASSO (least absolute shrinkage and selection operator) method described by Tibshirani (5) has been widely used in recent years to select small subsets of predictors, since it does not suffer from most of the drawbacks of stepwise selection.Also, although this is not an uncommon practice in medical publications discussing development of clinical scores, rounding the beta coefficients to an integer could lead to loss of precision of the predictive score and thus to a decreased AUC. The argument for simplifying a predictive score to render it easier to use in clinical pr...