Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study

Wakamiya, Shoko; Kawai, Yukiko; Aramaki, Eiji

doi:10.2196/publichealth.8627

Cited by 69 publications

(74 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The vast majority of the papers reviewed focussed on analysing English language text (68 papers), with two papers focussing on Chinese text [76,77] and one paper focussing on Japanese text [31]. With respect to the geographical location of first authors, most of the articles emerged from North America (55), with Europe 7, and Asia (including Australasia and Turkey) (6) all represented.…”

Section: Methodsmentioning

confidence: 99%

“…Of the six papers reviewed (see Table 4), four used Twitter data [31][32][33]57], and two used Reddit data [10,14], while Al-Garadi et al, provided a review that concentrated on Twitter and Weibo, the Chinese language microblog service [32]. Two of the papers reviewed described the use of supervised machine learning methods [31,32], three papers used unsupervised machine learning methods [10,14,32], and one used a lexicon-based approach [57]. Machine learning methods were used to perform a variety of tasks, including surveillance [10,14,[31][32][33]57], health communication [32], and sentiment analysis [32].…”

Section: Communicable Diseases and Sexually Transmitted Infectionsmentioning

confidence: 99%

“…Two of the papers reviewed described the use of supervised machine learning methods [31,32], three papers used unsupervised machine learning methods [10,14,32], and one used a lexicon-based approach [57]. Machine learning methods were used to perform a variety of tasks, including surveillance [10,14,[31][32][33]57], health communication [32], and sentiment analysis [32]. Several studies concentrated on influenza surveillance using English [10,33] and Japanese [31] Twitter data.…”

Section: Communicable Diseases and Sexually Transmitted Infectionsmentioning

confidence: 99%

“…Machine learning methods were used to perform a variety of tasks, including surveillance [10,14,[31][32][33]57], health communication [32], and sentiment analysis [32]. Several studies concentrated on influenza surveillance using English [10,33] and Japanese [31] Twitter data.…”

Section: Communicable Diseases and Sexually Transmitted Infectionsmentioning

confidence: 99%

See 3 more Smart Citations

Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data

Conway

Chapman

2019

Yearb Med Inform

View full text Add to dashboard Cite

Objective: We present a narrative review of recent work on the utilisation of Natural Language Processing (NLP) for the analysis of social media (including online health communities) specifically for public health applications. Methods: We conducted a literature review of NLP research that utilised social media or online consumer-generated text for public health applications, focussing on the years 2016 to 2018. Papers were identified in several ways, including PubMed searches and the inspection of recent conference proceedings from the Association of Computational Linguistics (ACL), the Conference on Human Factors in Computing Systems (CHI), and the International AAAI (Association for the Advancement of Artificial Intelligence) Conference on Web and Social Media (ICWSM). Popular data sources included Twitter, Reddit, various online health communities, and Facebook. Results: In the recent past, communicable diseases (e.g., influenza, dengue) have been the focus of much social media-based NLP health research. However, mental health and substance use and abuse (including the use of tobacco, alcohol, marijuana, and opioids) have been the subject of an increasing volume of research in the 2016 - 2018 period. Associated with this trend, the use of lexicon-based methods remains popular given the availability of psychologically validated lexical resources suitable for mental health and substance abuse research. Finally, we found that in the period under review “modern" machine learning methods (i.e. deep neural-network-based methods), while increasing in popularity, remain less widely used than “classical" machine learning methods.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Communicable Diseases and Sexually Transmitted Infectionsmentioning

confidence: 99%

Section: Communicable Diseases and Sexually Transmitted Infectionsmentioning

confidence: 99%

Section: Communicable Diseases and Sexually Transmitted Infectionsmentioning

confidence: 99%

See 2 more Smart Citations

Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data

Conway

Chapman

2019

Yearb Med Inform

View full text Add to dashboard Cite

show abstract

“…The experiments' results demonstrated the practicability of the proposed approach, which showed acceptable correlation comparing with medical reports statistics, especially at the outbreak and early spread (early epidemic) stage. The authors extended their work in [67] and implemented a robust influenza prediction model that enabled the use of direct and indirect information using tweets from urban and rural areas in Japan. This work was further extended in [68].…”

Section: Twitter Data Analytics In Healthcarementioning

confidence: 99%

Sehaa: A Big Data Analytics Tool for Healthcare Symptoms and Diseases Detection Using Twitter, Apache Spark, and Machine Learning

et al. 2020

View full text Add to dashboard Cite

Smartness, which underpins smart cities and societies, is defined by our ability to engage with our environments, analyze them, and make decisions, all in a timely manner. Healthcare is the prime candidate needing the transformative capability of this smartness. Social media could enable a ubiquitous and continuous engagement between healthcare stakeholders, leading to better public health. Current works are limited in their scope, functionality, and scalability. This paper proposes Sehaa, a big data analytics tool for healthcare in the Kingdom of Saudi Arabia (KSA) using Twitter data in Arabic. Sehaa uses Naive Bayes, Logistic Regression, and multiple feature extraction methods to detect various diseases in the KSA. Sehaa found that the top five diseases in Saudi Arabia in terms of the actual afflicted cases are dermal diseases, heart diseases, hypertension, cancer, and diabetes. Riyadh and Jeddah need to do more in creating awareness about the top diseases. Taif is the healthiest city in the KSA in terms of the detected diseases and awareness activities. Sehaa is developed over Apache Spark allowing true scalability. The dataset used comprises 18.9 million tweets collected from November 2018 to September 2019. The results are evaluated using well-known numerical criteria (Accuracy and F1-Score) and are validated against externally available statistics.

show abstract

Concept Drift Adaptive Physical Event Detection for Social Media Streams

Suprem

Musaev

2019

Services – SERVICES 2019

View full text Add to dashboard Cite

Event detection has long been the domain of physical sensors operating in a static dataset assumption. The prevalence of social media and web access has led to the emergence of social, or human sensors who report on events globally. This warrants development of event detectors that can take advantage of the truly dense and high spatial and temporal resolution data provided by more than 3 billion social users. The phenomenon of concept drift, which causes terms and signals associated with a topic to change over time, renders static machine learning ineffective. Towards this end, we present an application for physical event detection on social sensors that improves traditional physical event detection with concept drift adaptation. Our approach continuously updates its machine learning classifiers automatically, without the need for human intervention. It integrates data from heterogeneous sources and is designed to handle weak-signal events (landslides, wildfires) with around ten posts per event in addition to large-signal events (hurricanes, earthquakes) with hundreds of thousands of posts per event. We demonstrate a landslide detector on our application that detects almost 350% more landslides compared to static approaches. Our application has high performance: using classifiers trained in 2014, achieving event detection accuracy of 0.988, compared to 0.762 for static approaches.

show abstract

Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study

Cited by 69 publications

References 39 publications

Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data

Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data

Sehaa: A Big Data Analytics Tool for Healthcare Symptoms and Diseases Detection Using Twitter, Apache Spark, and Machine Learning

Concept Drift Adaptive Physical Event Detection for Social Media Streams

Contact Info

Product

Resources

About