2023
DOI: 10.34133/hds.0078
|View full text |Cite
|
Sign up to set email alerts
|

#ChronicPain: Automated Building of a Chronic Pain Cohort from Twitter Using Machine Learning

Abstract: Background: Due to the high burden of chronic pain, and the detrimental public health consequences of its treatment with opioids, there is a high-priority need to identify effective alternative therapies. Social media is a potentially valuable resource for knowledge about self-reported therapies by chronic pain sufferers. Methods: We attempted to (a) verify the presence of large-scale chronic pain-related chatter on Twitter, (b) develop natural language processing and machin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 45 publications
0
2
0
Order By: Relevance
“…[3] For Task 2, the dataset included 5364 English-language tweets that mentioned a total of 32 therapies, such as medication, behavioral, and physical. [4] These tweets were posted by a cohort of users who self-reported chronic pain on Twitter, [11] so it is likely that the sentiments associated with the therapies are being expressed by patients who are actually experiencing them. For Task 3, the dataset included 10,150 tweets that were written in Latin American Spanish and reported COVID-19 symptoms of the user or someone known to the user, building on a previous iteration of this task involving the multi-class classification of Spanish-language tweets that mentioned COVID-19 symptoms.…”
Section: Data Collectionmentioning
confidence: 99%
“…[3] For Task 2, the dataset included 5364 English-language tweets that mentioned a total of 32 therapies, such as medication, behavioral, and physical. [4] These tweets were posted by a cohort of users who self-reported chronic pain on Twitter, [11] so it is likely that the sentiments associated with the therapies are being expressed by patients who are actually experiencing them. For Task 3, the dataset included 10,150 tweets that were written in Latin American Spanish and reported COVID-19 symptoms of the user or someone known to the user, building on a previous iteration of this task involving the multi-class classification of Spanish-language tweets that mentioned COVID-19 symptoms.…”
Section: Data Collectionmentioning
confidence: 99%
“…Every new trend has its challenges, and since natural language processing tasks are fundamental in nature, this fact can be used to carefully suggest the direction of future research. There are a few areas of inquiry that will need further study in the future: (1) It is difficult to mine the medical text data stored in EHR or social media platforms because of imbalanced data, misspellings or ambiguity, high lexical variability, the difficulty of de-identification, the lack of key annotation, and the difficulty of obtaining all the semantic and syntactic attributes from complex sentences due to the lack of efficient methods that can capture all of them (Li et al, 2019;Pandey, Pandey, Mishra, & Rhmann, 2021;Sarker, Gonzalez-Hernandez, & Perrone, 2019;X. Wang, Hripcsak, Markatou, & Friedman, 2009).…”
Section: Discovering Research Trends and Future Directionmentioning
confidence: 99%