Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States

Cramer, Estee Y.; Ray, Evan L.; Lopez, Velma K.; Bracher, Johannes; Brennen, Andrea; Rivadeneira, Alvaro J Castro; Gerding, Aaron; Gneiting, Tilmann; House, Katie; Huang, Yuxin; Jayawardena, Dasuni; Kanji, Abdul Hannan; Khandelwal, Ayush; Le, Khoa; Mühlemann, Anja; Niemi, Jarad; Shah, Apurv; Stark, Ariane; Wang, Yijin; Wattanachit, Nutcha; Zorn, Martha; Gu, Youyang; Jain, Sansiddh; Bannur, Nayana; Deva, Ayush; Kulkarni, Mihir; Merugu, Srujana; Raval, Alpan; Shingi, Siddhant; Tiwari, Avtansh; White, Jerome; Abernethy, Neil F.; Woody, Spencer; Dahan, Maytal; Fox, Spencer J.; Gaither, Kelly; Lachmann, Michael; Meyers, Lauren Ancel; Scott, James G.; Tec, Mauricio; Srivastava, Ajitesh; George, Glover E.; Cegan, Jeffrey C.; Dettwiller, Ian D.; England, William P.; Farthing, Matthew W.; Hunter, Robert H.; Lafferty, Brandon J.; Linkov, Igor; Mayo, Michael L.; Parno, Matthew; Rowland, Michael; Trump, Benjamin D.; Zhang‐James, Yanli; Chen, Samuel; Faraone, Stephen V.; Hess, Jonathan; Morley, Christopher P.; Salekin, Asif; Wang, Dongliang; Corsetti, Sabrina; Baer, Thomas M.; Eisenberg, Marisa C.; Falb, Karl; Huang, Yitao; Martin, Emily T.; McCauley, Ella; Myers, Robert L.; Schwarz, T.; Sheldon, Daniel; Gibson, Graham Casey; Yu, Rose; Gao, Liyao; Ma, Yuanlin; Wu, Dongxia; Yan, Xifeng; Jin, Xiaoyong; Wang, Yuxiang; Chen, YangQuan; Li-hong, Guo; Zhao, Yong; Gu, Quanquan; Chen, Jinghui; Wang, Lingxiao; Xu, Pan; Zhang, Weitong; Zou, Difan; Biegel, Hannah; Lega, J.; McConnell, Steve; Nagraj, VP; Guertin, Stephanie L; Hulme-Lowe, Christopher; Turner, Stephen D.; Shi, Yunfeng; Ban, Xuegang; Walraven, Robert; Hong, Qi‐Jun; Kong, Stanley; Walle, Axel van de; Turtle, James; Ben‐Nun, M.; Riley, Steven; Riley, Pete; Koyluoglu, Ugur; DesRoches, David; Forli, Pedro; Hamory, Bruce H.; Kyriakides, Christina; Leis, Helen; Milliken, John; Moloney, Michael; Morgan, James P.; Nirgudkar, Ninad; Ozcan, Gokce; Piwonka, Noah; Ravi, Matt; Schrader, Chris; Shakhnovich, Elizabeth A.; Siegel, Daniel M.; Spatz, Ryan; Stiefeling, Chris; Wilkinson, Barrie; Wong, Alexander; Cavany, Sean; España, Guido; Moore, Sean M.; Oidtman, Rachel J.; Perkins, T. Alex; Kraus, David; Kraus, Andrea; Gao, Zhifeng; Bian, Jiang; Cao, Wei; Ferres, Juan M. Lavista; Li, Chaozhuo; Liu, Tie-Yan; Xie, Xing; Zhang, Shun; Zheng, Shuai; Vespignani, Alessandro; Chinazzi, Matteo; Davis, Jessica T.; Mu, Kunpeng; Piontti, Ana Pastore y; Xiong, Xinyue; Zheng, Andrew; Baek, Jackie; Farias, Vivek F.; Georgescu, Andreea; Levi, Retsef; Sinha, Deeksha; Wilde, Joshua; Perakis, Georgia; Bennouna, M. Amine; Nze-Ndong, David; Singhvi, Divya; Spantidakis, Ιoannis; Thayaparan, Leann; Tsiourvas, Asterios; Sarker, Arnab; Jadbabaie, Ali; Shah, Devavrat; Penna, Nicolás Della; Celi, Leo Anthony; Sundar, Saketh; Wolfinger, Russ; Osthus, Dave; Castro, Lauren; Fairchild, Geoffrey; Michaud, Isaac; Karlen, D.; Kinsey, Matt; Mullany, Luke C.; Rainwater-Lovett, Kaitlin; Shin, Lauren; Tallaksen, Katharine; Wilson, Shelby; Lee, Elizabeth C.; Dent, Juan; Grantz, Kyra H.; Hill, Alison L.; Kaminsky, Joshua; Kaminsky, Kathryn; Keegan, Lindsay T; Lauer, Stephen A.; Lemaitre, Joseph C.; Lessler, Justin; Meredith, Hannah R.; Perez-Saez, Javier; Shah, Sam; Smith, Claire P; Truelove, Shaun; Wills, Josh; Marshall, Maximilian; Gardner, Lauren; Nixon, Kristen; Burant, John C.; Wang, Lily; Gao, Leilei; Gu, Zhiling; Kim, Myung-Jin; Li, Xinyi; Wang, Guannan; Wang, Yueying; Yu, Shan; Reiner, Robert C.; Barber, Ryan M; Gakidou, Emmanuela; Hay, Simon I.; Lim, Steve; Murray, Chris; Pigott, David M.; Gurung, Heidi; Baccam, Prasith; Stage, Steven A.; Suchoski, Bradley T.; Prakash, B. Aditya; Adhikari, Bijaya; Cui, Jiaming; Rodríguez, Alexander; Tabassum, Anika; Xie, Jiajia; Keskinocak, Pınar; Asplund, John; Baxter, Arden; Oruc, Buse Eylul; Serban, Nicoleta; Arık, Sercan Ö.; Dusenberry, Mike; Epshteyn, Arkady; Kanal, Elli; Le, Long T.; Li, Chun-Liang; Pfister, Tomas; Sava, Dario; Sinha, Rajarishi; Tsai, Thomas C.; Yoder, Nathanael C.; Yoon, Jinsung; Zhang, Leyou; Abbott, Sam; Bosse, Nikos I.; Funk, Sebastian; Hellewell, Joel; Meakin, Sophie; Sherratt, Katharine; Zhou, Mingyuan; Kalantari, Rahi; Yamana, Teresa K.; Pei, Sen; Shaman, Jeffrey; Li, Michael Lingzhi; Bertsimas, Dimitris; Lami, Omar Skali; Soni, Saksham; Bouardi, Hamza Tazi; Ayer, Turgay; Adee, Madeline; Chhatwal, Jagpreet; Dalgıç, Özden O.; Ladd, Mary Ann; Linas, Benjamin P.; Mueller, Peter P.; Xiao, Jade; Wang, Yuanjia; Wang, Qinxia; Xie, Shanghong; Zeng, Donglin; Green, Alden; Bien, Jacob; Brooks, Logan; Hu, Addison J.; Jahja, Maria; McDonald, Daniel J.; Narasimhan, Balasubramanian; Politsch, Collin A.; Rajanala, Samyak; Rumack, Aaron; Simon, Noah; Tibshirani, Ryan J.; Tibshirani, Rob; Ventura, Valérie; Wasserman, Larry; O’Dea, Eamon B.; Drake, John M.; Pagano, Robert R.; Tran, Quoc; Ho, Lam Si Tung; Huynh, Huong; Walker, Jo; Slayton, Rachel B.; Johansson, Michael A.; Biggerstaff, Matthew; Reich, Nicholas G.

doi:10.1073/pnas.2113561119

Cited by 201 publications

(178 citation statements)

References 29 publications

Supporting

Mentioning

154

Contrasting

Order By: Relevance

“…The data for the first study come from the CDC collection of COVID-19 forecasts. In April 2020, and with the growing need to forecast COVID-19 trajectories in the United States, the CDC partnered with a research laboratory at the University of Massachusetts Amherst to create a forecast hub that collects and synthesizes COVID-19 trajectory predictions [ 20 ]. As of May 2021, more than 70 modeling groups from academic institutions, research laboratories, and the private sector had contributed by submitting their simulation-based trajectory predictions for various locations in the United States (typically at the state and national levels, but also some county-level predictions).…”

Section: First Study: Associations Between Model Architecture and Pre...mentioning

confidence: 99%

“…the flexibility of likelihood functions used in estimation). While it is feasible to combine point predictions and prediction intervals using proper scoring rules [ 22 ], we maintain our focus distinctly on drivers of point prediction accuracy and omit a comparison of prediction intervals (e.g., see [ 20 , 21 ] for comparing those intervals). We collected all the state- and national-level forecasts we could find on the project website.…”

Section: First Study: Associations Between Model Architecture and Pre...mentioning

confidence: 99%

“…The performance of a median ensemble (of the 61 models of the CDC hub plus the SEIRb set) is also plotted for comparison. Median ensemble is a promising approach to improve predictions: by incorporating information from all predictions from a group of models, while excluding outliers, it significantly improves predictions over any typical model [ 20 , 31 , 32 ], and thus it is a tough standard to beat. SEIRb outperforms the median ensemble.…”

Section: Second Study: Designing a Model To Assess Prediction-enhanci...mentioning

confidence: 99%

“…We focus on 490,210 point-forecasts for weekly death incidents (across 57 locations); forecast dates over the span of a year (4/13/2020 to 3/29/2021); 20 forecast horizons (1-week-ahead predictions to 20-week-ahead predictions); and 61 models (not every model offers a forecast for every location, forecast date, or horizon). Prior research has used these data to assess the quality of probabilistic forecasts from various models [ 20 , 21 ]. We build on this work by examining the association between the modeling approach and point prediction quality using hand-coded architectural features of forecasting models.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Enhancing long-term forecasting: Learning from COVID-19 models

Rahmandad

Ghaffarzadegan

2022

PLoS Comput Biol

View full text Add to dashboard Cite

While much effort has gone into building predictive models of the COVID-19 pandemic, some have argued that early exponential growth combined with the stochastic nature of epidemics make the long-term prediction of contagion trajectories impossible. We conduct two complementary studies to assess model features supporting better long-term predictions. First, we leverage the diverse models contributing to the CDC repository of COVID-19 USA death projections to identify factors associated with prediction accuracy across different projection horizons. We find that better long-term predictions correlate with: (1) capturing the physics of transmission (instead of using black-box models); (2) projecting human behavioral reactions to an evolving pandemic; and (3) resetting state variables to account for randomness not captured in the model before starting projection. Second, we introduce a very simple model, SEIRb, that incorporates these features, and few other nuances, offers informative predictions for as far as 20-weeks ahead, with accuracy comparable with the best models in the CDC set. Key to the long-term predictive power of multi-wave COVID-19 trajectories is capturing behavioral responses endogenously: balancing feedbacks where the perceived risk of death continuously changes transmission rates through the adoption and relaxation of various Non-Pharmaceutical Interventions (NPIs).

show abstract

Section: First Study: Associations Between Model Architecture and Pre...mentioning

confidence: 99%

Section: First Study: Associations Between Model Architecture and Pre...mentioning

confidence: 99%

Section: Second Study: Designing a Model To Assess Prediction-enhanci...mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Enhancing long-term forecasting: Learning from COVID-19 models

Rahmandad

Ghaffarzadegan

2022

PLoS Comput Biol

View full text Add to dashboard Cite

show abstract

“…Efforts to date have revealed some of the challenges with predicting the course of the COVID-19 pandemic 1,2 . Critically, individual risk reduction behaviors and policy compliance, which directly impact case growth, are not easily measured.…”

Section: Introductionmentioning

confidence: 99%

An Evaluation of Prospective COVID-19 Modeling: From Data to Science Translation

Nixon

Jindal

Parker

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

SummaryBackgroundInfectious disease modeling can serve as a powerful tool for science-based management of outbreaks, providing situational awareness and decision support for policy makers. Predictive modeling of an emerging disease is challenging due to limited knowledge on its epidemiological characteristics. For COVID-19, the prediction difficulty was further compounded by continuously changing policies, varying behavioral responses, poor availability and quality of crucial datasets, and the variable influence of different factors as the pandemic progresses. Due to these challenges, predictive modeling for COVID-19 has earned a mixed track record.MethodsWe provide a systematic review of prospective, data-driven modeling studies on population-level dynamics of COVID-19 in the US and conduct a quantitative assessment on crucial elements of modeling, with a focus on the aspects of modeling that are critical to make them useful for decision-makers. For each study, we documented the forecasting window, methodology, prediction target, datasets used, geographic resolution, whether they expressed quantitative uncertainty, the type of performance evaluation, and stated limitations. We present statistics for each category and discuss their distribution across the set of studies considered. We also address differences in these model features based on fields of study.FindingsOur initial search yielded 2,420 papers, of which 119 published papers and 17 preprints were included after screening. The most common datasets relied upon for COVID-19 modeling were counts of cases (93%) and deaths (62%), followed by mobility (26%), demographics (25%), hospitalizations (12%), and policy (12%). Our set of papers contained a roughly equal number of short-term (46%) and long-term (60%) predictions (defined as a prediction horizon longer than 4 weeks) and statistical (43%) versus compartmental (47%) methodologies. The target variables used were predominantly cases (89%), deaths (52%), hospitalizations (10%), and Rt (9%). We found that half of the papers in our analysis did not express quantitative uncertainty (50%). Among short-term prediction models, which can be fairly evaluated against truth data, 25% did not conduct any performance evaluation, and most papers were not evaluated over a timespan that includes varying epidemiological dynamics. The main categories of limitations stated by authors were disregarded factors (39%), data quality (28%), unknowable factors (26%), limitations specific to the methods used (22%), data availability (16%), and limited generalizability (8%). 36% of papers did not list any limitations in their discussion or conclusion section.InterpretationPublished COVID-19 models were found to be consistently lacking in some of the most important elements required for usability and translation, namely transparency, expressing uncertainty, performance evaluation, stating limitations, and communicating appropriate interpretations. Adopting the EPIFORGE 2020 guidelines would address these shortcomings and improve the consistency, reproducibility, comparability, and quality of epidemic forecasting reporting. We also discovered that most of the operational models that have been used in real-time to inform decision-making have not yet made it into the published literature, which highlights that the current publication system is not suited to the rapid information-sharing needs of outbreaks. Furthermore, data quality was identified to be one of the most important drivers of model performance, and a consistent limitation noted by the modeling community. The US public health infrastructure was not equipped to provide timely, high-quality COVID-19 data, which is required for effective modeling. Thus, a systematic infrastructure for improved data collection and sharing should be a major area of investment to support future pandemic preparedness.

show abstract