Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
Background Stigma surrounding substance use can result in severe consequences for physical and mental health. Identifying situations in which stigma occurs and characterizing its impact could be a critical step toward improving outcomes for individuals experiencing stigma. As part of a larger research project with the goal of informing the development of interventions for substance use disorder, this study leverages natural language processing methods and a theory-informed approach to identify and characterize manifestations of substance use stigma in social media data. Methods We harvested social media data, creating an annotated corpus of 2,214 Reddit posts from subreddits relating to substance use. We trained a set of binary classifiers; each classifier detected one of three stigma types: Internalized Stigma, Anticipated Stigma, and Enacted Stigma, from the Stigma Framework. We evaluated hybrid models that combine contextual embeddings with features derived from extant lexicons and handcrafted lexicons based on stigma theory, and assessed the performance of these models. Then, using the trained and evaluated classifiers, we performed a mixed-methods analysis to quantify the presence and type of stigma in a corpus of 161,448 unprocessed posts derived from subreddits relating to substance use. Results For all stigma types, we identified hybrid models (RoBERTa combined with handcrafted stigma features) that significantly outperformed RoBERTa-only baselines. In the model’s predictions on our unseen data, we observed that Internalized Stigma was the most prevalent stigma type for alcohol and cannabis, but in the case of opioids, Anticipated Stigma was the most frequent. Feature analysis indicated that language conveying Internalized Stigma was predominantly characterized by emotional content, with a focus on shame, self-blame, and despair. In contrast, Enacted Stigma and Anticipated involved a complex interplay of emotional, social, and behavioral features. Conclusion Our main contributions are demonstrating a theory-based approach to extracting and comparing different types of stigma in a social media dataset, and employing patterns in word usage to explore and characterize its manifestations. The insights from this study highlight the need to consider the impacts of stigma differently by mechanism (internalized, anticipated, and enacted), and enhance our current understandings of how each stigma mechanism manifests within language in particular cognitive, emotional, social, and behavioral aspects.
Background Stigma surrounding substance use can result in severe consequences for physical and mental health. Identifying situations in which stigma occurs and characterizing its impact could be a critical step toward improving outcomes for individuals experiencing stigma. As part of a larger research project with the goal of informing the development of interventions for substance use disorder, this study leverages natural language processing methods and a theory-informed approach to identify and characterize manifestations of substance use stigma in social media data. Methods We harvested social media data, creating an annotated corpus of 2,214 Reddit posts from subreddits relating to substance use. We trained a set of binary classifiers; each classifier detected one of three stigma types: Internalized Stigma, Anticipated Stigma, and Enacted Stigma, from the Stigma Framework. We evaluated hybrid models that combine contextual embeddings with features derived from extant lexicons and handcrafted lexicons based on stigma theory, and assessed the performance of these models. Then, using the trained and evaluated classifiers, we performed a mixed-methods analysis to quantify the presence and type of stigma in a corpus of 161,448 unprocessed posts derived from subreddits relating to substance use. Results For all stigma types, we identified hybrid models (RoBERTa combined with handcrafted stigma features) that significantly outperformed RoBERTa-only baselines. In the model’s predictions on our unseen data, we observed that Internalized Stigma was the most prevalent stigma type for alcohol and cannabis, but in the case of opioids, Anticipated Stigma was the most frequent. Feature analysis indicated that language conveying Internalized Stigma was predominantly characterized by emotional content, with a focus on shame, self-blame, and despair. In contrast, Enacted Stigma and Anticipated involved a complex interplay of emotional, social, and behavioral features. Conclusion Our main contributions are demonstrating a theory-based approach to extracting and comparing different types of stigma in a social media dataset, and employing patterns in word usage to explore and characterize its manifestations. The insights from this study highlight the need to consider the impacts of stigma differently by mechanism (internalized, anticipated, and enacted), and enhance our current understandings of how each stigma mechanism manifests within language in particular cognitive, emotional, social, and behavioral aspects.
With the research sex gap impacting available data on women’s health and the growing popularity of social media, it is not rare that individuals will seek health-related information on such platforms. Understanding how women use social media for perinatal-specific issues is crucial to gain knowledge on specific needs and gaps. The Tumblr platform is an excellent candidate to further understand the representation and discourse regarding perinatal health on social media. The objective was to identify specific themes to assess the present discourse pertaining to perinatal health. Posts were collected using Tumblr’s official API client over a 4-day period, from August 18 to 21, 2023, inclusively. A sentiment analysis was performed using the Valence Aware Dictionary and sEntiment Reasoner sentiment analysis toolkit and a deductive thematic analysis. In total, 235 posts were analyzed, and 11 individual categories were identified and divided into two main concepts; Women’s Health (Endometriosis; Postpartum Depression, Menopause, Miscarriage, Other Health Problems, Political Discourse) and Pregnancy/Childbirth (Maternal Mortality, Personal Stories, Pregnancy Symptoms, and Fitness/diet/weight). The last category was classified as Misinformation/Advertisement. Findings revealed that users used the Tumblr platform to share personal experiences regarding pregnancy, seek support from others, raise awareness, and educate on women’s health topics. Misinformation represented only 3% of the total sample. The present study demonstrates the feasibility of using in-depth data from Tumblr posts to inform us regarding current issues and topics specific to perinatal and women’s health. More research studies are needed to better understand the impact of social support and misinformation on perinatal health.
Background: Obesity is a chronic, multifactorial and relapsing disease, affecting people of all ages worldwide and is directly related to multiple complications. Understanding public attitudes and perceptions towards obesity is essential for developing effective health policies, prevention strategies, and treatment approaches.Objective: This study investigates the sentiments of the general public, celebrities, and important organizations regarding obesity using social media data, specifically from Twitter. Methods:The study analyzes a dataset of 53,414 tweets related to obesity posted on Twitter during the COVID-19 pandemic, from April 2019 to December 2022. Sentiment analysis was performed using the XLM-Roberta-base model, and topic modeling was conducted using the BERTopic library. Results:The analysis revealed that tweets regarding obesity were predominantly negative. Spikes in Twitter activity correlated with significant political events, such as negative comments on President Trump's obesity struggle and Boris Johnson's criticized obesity campaign. Ben Shapiro's remarks on not vaccinating people with obesity for COVID-19 also sparked outrage. Topic modeling identified 243 clusters representing various obesity-related topics, such as childhood obesity, President Trump's obesity struggle, COVID-19 vaccinations, Boris Johnson's obesity campaign, body shaming, racism and high obesity rates among Black Americans, smoking, substance abuse, and alcohol consumption among people with obesity, environmental risk factors, and surgical treatments. Conclusions:Twitter serves as a valuable source for understanding obesity-related sentiments and attitudes among the public, celebrities, and influential organizations. Sentiments regarding obesity were predominantly negative. Negative portrayals of obesity by influential politicians and celebrities were shown to contribute to negative public sentiments, which can have adverse effects on public health. It is essential for public figures to be mindful of their impact on public opinion and the potential consequences of their statements.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.