Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation

Williams, Matthew Leighton; Burnap, Pete; Sloan, Luke

doi:10.1177/0038038517708140

Cited by 299 publications

(237 citation statements)

References 41 publications

Supporting

Mentioning

228

Contrasting

Unclassified

Order By: Relevance

“…Thus, the development of automated tools might contribute to ethical practices and research implications, though outside the standard framework of research ethics. Algorithms should pursue the maximum benefit minimising the risk of potential harm during data collection, analysis and publication, while researchers should assess algorithms' performance and routinely test them for effectiveness, avoiding the mislabelling of content [61]. Furthermore, discarding re-tweets may be considered a discretionary choice, since we aimed at preliminarily investigating individual-level data on social networking about alcohol-related behaviours.…”

Section: Discussionmentioning

confidence: 99%

Detecting Binge Drinking and Alcohol-Related Risky Behaviours from Twitter’s Users: An Exploratory Content- and Topology-Based Analysis

Crocamo

Viviani

Bartoli

et al. 2020

IJERPH

View full text Add to dashboard Cite

Binge Drinking (BD) is a common risky behaviour that people hardly report to healthcare professionals, although it is not uncommon to find, instead, personal communications related to alcohol-related behaviors on social media. By following a data-driven approach focusing on User-Generated Content, we aimed to detect potential binge drinkers through the investigation of their language and shared topics. First, we gathered Twitter threads quoting BD and alcohol-related behaviours, by considering unequivocal keywords, identified by experts, from previous evidence on BD. Subsequently, a random sample of the gathered tweets was manually labelled, and two supervised learning classifiers were trained on both linguistic and metadata features, to classify tweets of genuine unique users with respect to media, bot, and commercial accounts. Based on this classification, we observed that approximately 55% of the 1 million alcohol-related collected tweets was automatically identified as belonging to non-genuine users. A third classifier was then trained on a subset of manually labelled tweets among those previously identified as belonging to genuine accounts, to automatically identify potential binge drinkers based only on linguistic features. On average, users classified as binge drinkers were quite similar to the standard genuine Twitter users in our sample. Nonetheless, the analysis of social media contents of genuine users reporting risky behaviours remains a promising source for informed preventive programs.

show abstract

Section: Discussionmentioning

confidence: 99%

Detecting Binge Drinking and Alcohol-Related Risky Behaviours from Twitter’s Users: An Exploratory Content- and Topology-Based Analysis

Crocamo

Viviani

Bartoli

et al. 2020

IJERPH

View full text Add to dashboard Cite

show abstract

“…7 We suppose that users perceived the page as public space where they intentionally wanted to express their own opinion on vaccination. On the other hand, we also suppose that many users are not completely aware of the privacy setting and its consequences (Williams et al 2017) and we thus considered the risk of harm and the sensitivity of the data (see Townsend and Wallace 2016). In light of this consideration, several potentially illustrative quotations were dropped or trimmed down.…”

Section: Methodsmentioning

confidence: 99%

The vaccination debate in the “post‐truth” era: social media as sites of multi‐layered reflexivity

Numerato

Vochocová

Štětka

et al. 2019

Sociology Health & Illness

View full text Add to dashboard Cite

This paper analyses the contemporary public debate about vaccination, and medical knowledge more broadly, in the context of social media. The study is focused on the massive online debate prompted by the Facebook status of the digital celebrity Mark Zuckerberg, who posted a picture of his two‐month‐old daughter, accompanied by a comment: ’Doctor's visit – time for vaccines!’ Carrying out a qualitative analysis on a sample of 650 comments and replies, selected through systematic random sampling from an initial pool of over 10,000 user contributions, and utilising open and axial coding, we empirically inform the theoretical discussion around the concept of the reflexive patient and introduce the notion of multi‐layered reflexivity. We argue that the reflexive debate surrounding this primarily medical problem is influenced by both biomedical and social scientific knowledge. Lay actors therefore discuss not only vaccination, but also its political and economic aspects as well as the post‐truth information context of the debate. We stress that the reflexivity of social actors related to the post‐truth era re‐enters and influences the debate more than ever. Furthermore, we suggest that the interconnection of different layers of reflexivity can either reinforce certainty or deepen the ambiguity and uncertainty of reflexive agents.

show abstract

“…However, in practice it is often infeasible to seek retrospective consent from hundreds or thousands of social media users. According to current ethical guidelines for social media research (Benton et al, 2017a;Williams et al, 2017) and practice in comparable research projects (O'Dea et al, 2015;Ahmed et al, 2017), it is regarded as acceptable to waive explicit consent if the anonymity of the users is preserved. Therefore, we will not ask the account holders of Twitter and Reddit posts included in our datasets for their consent.…”

Section: Ethical Considerationsmentioning

confidence: 99%

A Computational Linguistic Study of Personal Recovery in Bipolar Disorder

Jagfeld

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop

View full text Add to dashboard Cite

Mental health research can benefit increasingly fruitfully from computational linguistics methods, given the abundant availability of language data in the internet and advances of computational tools. This interdisciplinary project will collect and analyse social media data of individuals diagnosed with bipolar disorder with regard to their recovery experiences. Personal recovery -living a satisfying and contributing life along symptoms of severe mental health issues -so far has only been investigated qualitatively with structured interviews and quantitatively with standardised questionnaires with mainly English-speaking participants in Western countries. Complementary to this evidence, computational linguistic methods allow us to analyse firstperson accounts shared online in large quantities, representing unstructured settings and a more heterogeneous, multilingual population, to draw a more complete picture of the aspects and mechanisms of personal recovery in bipolar disorder.

show abstract

Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation

Cited by 299 publications

References 41 publications

Detecting Binge Drinking and Alcohol-Related Risky Behaviours from Twitter’s Users: An Exploratory Content- and Topology-Based Analysis

Detecting Binge Drinking and Alcohol-Related Risky Behaviours from Twitter’s Users: An Exploratory Content- and Topology-Based Analysis

The vaccination debate in the “post‐truth” era: social media as sites of multi‐layered reflexivity

A Computational Linguistic Study of Personal Recovery in Bipolar Disorder

Contact Info

Product

Resources

About