Background
Cancer poses a serious threat to the health of Chinese people, resulting in a major challenge for public health work. Today, people can obtain relevant information from not only medical workers in hospitals, but also the internet in any place in real-time. Search behaviors can reflect a population’s awareness of cancer from a completely new perspective, which could be driven by the underlying cancer epidemiology. However, such Web-retrieved data are not yet well validated or understood.
Objective
This study aimed to explore whether a correlation exists between the incidence and mortality of cancers and normalized internet search volumes on the big data platform, Baidu. We also assessed whether the distribution of people who searched for specific types of cancer differed by gender. Finally, we determined whether there were regional disparities among people who searched the Web for cancer-related information.
Methods
Standard Boolean operators were used to choose search terms for each type of cancer. Spearman’s correlation analysis was used to explore correlations among monthly search index values for each cancer type and their monthly incidence and mortality rates. We conducted cointegration analysis between search index data and incidence rates to examine whether a stable equilibrium existed between them. We also conducted cointegration analysis between search index data and mortality data.
Results
The monthly Baidu index was significantly correlated with cancer incidence rates for 26 of 28 cancers in China (lung cancer:
r
=.80,
P
<.001; liver cancer:
r
=.28,
P
=.016; stomach cancer:
r
=.50,
P
<.001; esophageal cancer:
r
=.50,
P
<.001; colorectal cancer:
r
=.81,
P
<.001; pancreatic cancer:
r
=.86,
P
<.001; breast cancer:
r
=.56,
P
<.001; brain and nervous system cancer:
r
=.63,
P
<.001; leukemia:
r
=.75,
P
<.001; Non-Hodgkin lymphoma:
r
=.88,
P
<.001; Hodgkin lymphoma:
r
=.91,
P
<.001; cervical cancer:
r
=.64,
P
<.001; prostate cancer:
r
=.67,
P
<.001; bladder cancer:
r
=.62,
P
<.001; gallbladder and biliary tract cancer:
r
=.88,
P
<.001; lip and oral cavity cancer:
r
=.88,
...