Objective. To develop the model for early prediction of clinically significant bronchopulmonary dysplasia in extremely premature infants.
Materials and methods. 226 premature infants with gestational age less than 31 weeks, birth weight from 490 to 999 g, age from 0 to 7 days, and respiratory failure requiring ventilatory support (ventilator support) were included into a retrospective study conducted in the Perm Regional Perinatal Center. Machine learning algorithms such as logistic regression, support vector machine, random forest method, and gradient boosting method were used for the prognostic model building. Five variables were used: birth weight, Apgar score in the 5th minute of life, Silverman score, number of days of invasive ventilatory support, median oxygen fraction in the inhaled air measured daily during the first seven days of life.
Results. In the 36th week of postconceptional age 148 out of 182 infants (81.3%) in the study cohort developed bronchopulmonary dysplasia (BPD), among them 15.4% had a mild form, 29.7% a moderate one, and in 36.3% of patient it was severe. Among the four studied prediction algorithms, logistic regression model was chosen as the final model with metrics: AUC=0.840, accuracy 0.818, sensitivity 0.972, specificity 0.666. The practical application of the modeling results was implemented in the form of a probability calculator.
Conclusions. In the early neonatal period of extremely premature infants, a combination of clinical predictors such as birth weight, Apgar score in the 5th minute of life, Silverman score, number of days of invasive ventilatory support, median oxygen fraction in the inhaled air measured during the first seven days of life can be used to predict the development of bronchopulmonary dysplasia. The logistic regression model shows high sensitivity that minimizes the probability of an error of second kind. Thus, its application is useful in the early prediction of bronchopulmonary dysplasia in premature infants.