In the midst of an outbreak or sustained epidemic, reliable prediction of transmission risks and patterns of spread is critical to inform public health programs. Projections of growth or decline among specific risk groups can aid in optimizing interventions, particularly when resources are limited. Phylogenetic trees have been widely used in the detection of transmission chains and high-risk populations. Moreover, tree topology and the incorporation of population parameters (phylodynamics) can be useful to reconstruct the evolutionary dynamics of an epidemic across space and time among individuals. We now demonstrate the utility of phylogenetic trees for infection forecasting in addition to backtracking, developing a phylogeny-based deep learning system, called DeepDynaForecast. Our approach leverages a primal-dual graph learning structure with shortcut multi-layer aggregation, and it is suited for the early identification and prediction of transmission dynamics in emerging high-risk groups. We demonstrate the accuracy of DeepDynaForecast using simulated outbreak data and the utility of the learned model using empirical, large-scale data from the human immunodeficiency virus epidemic in Florida between 2012 and 2020. Our framework is available as open-source software (MIT license) at: https://github.com/lab-smile/DeepDynaForcast.
In the midst of an outbreak, identification of groups of individuals that represent risk for transmission of the pathogen under investigation is critical to public health efforts. Several approaches exist that utilize the evolutionary information from pathogen genomic data derived from infected individuals to distinguish these groups from the background population, comprised of primarily randomly sampled individuals with undetermined epidemiological linkage. These methods are, however, limited in their ability to characterize the dynamics of these groups, or clusters of transmission. Dynamic transmission patterns within these clusters, whether it be the result of changes at the level of the virus (e.g., infectivity) or host (e.g., vaccination implementation), are critical in strategizing public health interventions, particularly when resources are limited. Phylogenetic trees are widely used not only in the detection of transmission clusters, but the topological shape of the branches within can be useful sources of information regarding the dynamics of the represented population. We evaluate the limitation of existing tree shape statistics when dealing with smaller subtrees containing transmission clusters and offer instead a phylogeny based deep learning system (DeepDynaTree) for classification of transmission cluster. Comprehensive experiments carried out on a variety of simulated epidemic growth models indicate that this graph deep learning approach is effective in predicting cluster dynamics (balanced accuracy of 0.826 vs. 0.533 and Brier score of 0.234 vs. 0.466 in independent test set). Our deployment model in DeepDynaTree incorporates a primal-dual graph neural network principle using output from phylogenetic-based cluster identification tools (available from https://github.com/salemilab/DeepDynaTree).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.