OBJECTIVES: Online patient discussions in health-related social media forums represent a rich and increasingly important source for helping to determine the patient perspective concerning all aspects of their medical condition. Yet there is a common perception in the pharmaceutical industry that use of such data will lead to an overwhelming number of reportable adverse events (AEs). This perception can potentially hold back the use of health-related social media, and thereby hold back an understanding of the patient perspective. In this study, we set out to determine the frequency of reportable AEs in a large sample of patient posts. METHODS: We used a combination of regular expressions and machine learning techniques on a collection of 10,000 posts obtained from cancer discussion forums, to detect posts that met all four of the required criteria for reporting of AEs. We first quantified the performance of the machine learning algorithm with a set of simulated testing posts that were randomly generated by inserting combinations of randomly generated postal addresses, email addresses, zip codes and telephone numbers together with AEs and treatment names. Potentially reportable posts were then manually reviewed. RESULTS: Under testing conditions with simulated posts, the machine learning algorithm identified reportable posts with an AUC of 0.928 and an overall accuracy of 88%. On the collection of 10,000 real posts from cancer forums the algorithm identified 505 potentially reportable posts for manual review. After manual review only two posts met all four criteria for reporting AEs. CONCLUSIONS: Whilst there is a concern that studies involving use of healthrelated social media discussions will lead to a large number of AEs being detected, this study found that posts meeting all four criteria for reporting are very rare, with only 0.1% of posts meeting the criteria for reporting.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.