2017
DOI: 10.2196/publichealth.6577
|View full text |Cite
|
Sign up to set email alerts
|

Filtering Entities to Optimize Identification of Adverse Drug Reaction From Social Media: How Can the Number of Words Between Entities in the Messages Help?

Abstract: BackgroundWith the increasing popularity of Web 2.0 applications, social media has made it possible for individuals to post messages on adverse drug reactions. In such online conversations, patients discuss their symptoms, medical history, and diseases. These disorders may correspond to adverse drug reactions (ADRs) or any other medical condition. Therefore, methods must be developed to distinguish between false positives and true ADR declarations.ObjectiveThe aim of this study was to investigate a method for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 36 publications
(30 citation statements)
references
References 41 publications
0
30
0
Order By: Relevance
“…An additional thesaurus containing patient language must be created in order to normalize the vocabulary found in the messages so that it can be recognized by medical reference thesauri. We will take into account the number of words between the detected drug and event as recent evidence shows that such distance can be used for identifying false positives and filter events that are likely to be ADRs [ 48 ].…”
Section: Discussionmentioning
confidence: 99%
“…An additional thesaurus containing patient language must be created in order to normalize the vocabulary found in the messages so that it can be recognized by medical reference thesauri. We will take into account the number of words between the detected drug and event as recent evidence shows that such distance can be used for identifying false positives and filter events that are likely to be ADRs [ 48 ].…”
Section: Discussionmentioning
confidence: 99%
“…The data was extracted from the Detec’t database [ 26 ], a database developed by Kappa Santé [ 27 ] that collects messages from several French forums using a Web crawler. Detec’t extracts messages from forums based on a named entity recognition module using a drug lexicon made by Kappa Santé and a fuzzy matching algorithm.…”
Section: Methodsmentioning
confidence: 99%
“…One of the principal challenges is the extraction medical entities from noisy patient-generated content. Given the large volume of social media posts, efforts towards the automatic text classification for ADR detection are receiving growing attention [70,[94][95][96][97][98][99][100][101]. However, lexicon-based approaches [47] for medical entity recognition and tools like MetaMap [102], developed by the US National Library of Medicine to identify medical concepts into the concept codes from the Unified Medical Language System Metathesaurus (UMLS), are not sufficient, given the informal, colloquial nature of discussions and the non-adherence to standardised terminology used by participants [103].…”
Section: Information Extraction From Social Mediamentioning
confidence: 99%
“…The need for manual data labelling is expected to drop considerably with the application of neural network-based tools [113,114]. Abdellaoui et al [98] apply distance-based filtering in order to distinguish between false positives and true ADR declarations. The framework proposed by Liu and Chen [71] employs a hybrid approach combining statistical machine learning methods and rule-based filtering with information from medical knowledge bases, and report source classification to reduce noise.…”
Section: Information Extraction From Social Mediamentioning
confidence: 99%