2019
DOI: 10.2196/12414
|View full text |Cite
|
Sign up to set email alerts
|

Data Analysis and Visualization of Newspaper Articles on Thirdhand Smoke: A Topic Modeling Approach

Abstract: Background Thirdhand smoke has been a growing topic for years in China. Thirdhand smoke (THS) consists of residual tobacco smoke pollutants that remain on surfaces and in dust. These pollutants are re-emitted as a gas or react with oxidants and other compounds in the environment to yield secondary pollutants. Objective Collecting media reports on THS from major media outlets and analyzing this subject using topic modeling can facilitate a better understanding of the rol… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 25 publications
0
10
0
Order By: Relevance
“…Classifying information sources and content in corresponding topics to identify priorities for interventions has been widely applied in many studies. For example, previous authors have analyzed newspaper and social media content to understand topics of interest related to breast cancer and secondhand smoking [23-28]. However, none of the previous studies have analyzed the scientific bibliography to determine the development of research landscapes in AI applied in cancer care.…”
Section: Discussionmentioning
confidence: 99%
“…Classifying information sources and content in corresponding topics to identify priorities for interventions has been widely applied in many studies. For example, previous authors have analyzed newspaper and social media content to understand topics of interest related to breast cancer and secondhand smoking [23-28]. However, none of the previous studies have analyzed the scientific bibliography to determine the development of research landscapes in AI applied in cancer care.…”
Section: Discussionmentioning
confidence: 99%
“…Knowledge that two posts have come from the same discussion thread constitutes valuable domain information that can inform topic modelling, and LabEL is perfectly suited for this kind of analysis. As another example, Liu et al 33 analyse the topics present in articles about thirdhand smoke extracted from new databases, including Factiva. 50 Factiva uses a proprietary taxonomy called Dow Jones Intelligent Identifiers to provide labels to articles, including data elements such as region, topic and company (among others).…”
Section: Discussionmentioning
confidence: 99%
“…14 One particular unsupervised topic modelling method, latent Dirichlet allocation (LDA), 15 has proven particularly popular and successful. LDA has been used for topic mining in studies of health data across an array of data sources, including discussions from condition-specific online support groups [16][17][18][19][20] and more general online discussion platforms, [21][22][23][24][25][26][27][28][29] data about adverse medical events, 30 interview transcripts of patients, 31 32 media articles 33 and survey data. 34 35 Other studies have used LDA to analyse topics in patient-reported concerns as well, in situations where no existing topic information is available.…”
Section: What Does This Paper Add?mentioning
confidence: 99%
“…The reason why we chose LDA topic modeling is because it is a type of topic modeling method that has been utilized in a number of studies for text mining [30], or in fields of psychology and medicine [31,32], as well as our team's previous research on third-hand smoke [33]. This unsupervised machine learning technique automatically generates topics from documents and categorizes similar documents to one or more of these topics based on the distribution of words, so we can implement a lot of text analysis.…”
Section: Model Introductionmentioning
confidence: 99%