Epidemiological models can provide the dynamic evolution of a pandemic but they are based on many assumptions and parameters that have to be adjusted over the time when the pandemic lasts. However, often the available data are not sufficient to identify the model parameters and hence infer the unobserved dynamics.
Here, we develop a general framework for building a trustworthy data-driven epidemiological model, consisting of a workflow that integrates data acquisition and event timeline, model development, identifiability analysis, sensitivity analysis, model calibration, model robustness analysis, and forecasting with uncertainties in different scenarios.
In particular, we apply this framework to propose a modified susceptible-exposed-infectious-recovered (SEIR) model, including new compartments and model vaccination in order to forecast the transmission dynamics of COVID-19 in New York City (NYC).
We find that we can uniquely estimate the model parameters and accurately predict the daily new infection cases, hospitalizations, and deaths, in agreement with the available data from NYC's government's website. In addition, we employ the calibrated data-driven model to study the effects of vaccination and timing of reopening indoor dining in NYC.