2021
DOI: 10.1038/s41598-021-83084-5
|View full text |Cite
|
Sign up to set email alerts
|

Use Internet search data to accurately track state level influenza epidemics

Abstract: For epidemics control and prevention, timely insights of potential hot spots are invaluable. Alternative to traditional epidemic surveillance, which often lags behind real time by weeks, big data from the Internet provide important information of the current epidemic trends. Here we present a methodology, ARGOX (Augmented Regression with GOogle data CROSS space), for accurate real-time tracking of state-level influenza epidemics in the United States. ARGOX combines Internet search data at the national, regiona… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
20
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 13 publications
(20 citation statements)
references
References 34 publications
0
20
0
Order By: Relevance
“…We use 23 highly correlated COVID-19 related Google search queries discovered in prior study 44 (in daily frequency) for COVID-19 cases and deaths forecasts, while using ILI related queries (weekly frequency) from previous study 22,24 for %ILI forecasts. We obtain the search queries for national, regional (summation from states) and state level.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…We use 23 highly correlated COVID-19 related Google search queries discovered in prior study 44 (in daily frequency) for COVID-19 cases and deaths forecasts, while using ILI related queries (weekly frequency) from previous study 22,24 for %ILI forecasts. We obtain the search queries for national, regional (summation from states) and state level.…”
Section: Methodsmentioning
confidence: 99%
“…For COVID-19, our raw estimates for state m week τ cases/deaths y τ,m are ŷGT τ;m , ŷreg τ;r m , ŷnat τ , and y τ−1,m , where r m is the region number for state m. Here, we denote GT and reg to be state/regional estimates with internet search information only, and nat to be national estimates (same as prior study 44 ). Similarly, we obtain the raw estimates for state m weekly %ILI p τ,m : pGT τ;m 24 , preg τ;r m 23 , pnat τ 22 , p τ−1,m . In the second step, we fit two models separately using the raw estimates from step 1 as inputs.…”
Section: Forecasting Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The detailed data collection procedure and subsequent data pre-processing (introduced in sections below) are illustrated in the flowchart (Figure S1). In step 1 (green highlighted boxes in Figure S1), to curate the pool of potentially predictive queries, we first started with 129 influenza (flu) related queries based on prior studies 9 , 24 , 25 . Then, we changed “Influnza” and “Flu” keywords to “Coronavirus” and “COVID-19”, respectively.…”
Section: Methodsmentioning
confidence: 99%