2023
DOI: 10.1177/20539517221145372
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning and the politics of synthetic data

Abstract: Machine-learning algorithms have become deeply embedded in contemporary society. As such, ample attention has been paid to the contents, biases, and underlying assumptions of the training datasets that many algorithmic models are trained on. Yet, what happens when algorithms are trained on data that are not real, but instead data that are ‘synthetic’, not referring to real persons, objects, or events? Increasingly, synthetic data are being incorporated into the training of machine-learning algorithms for use i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(13 citation statements)
references
References 38 publications
0
13
0
Order By: Relevance
“…In part, the necessity of consistent scrutiny reiterates the need for established methods to assess SD’s actionability 2,23,24 . Regardless, SD can technically be made actionable, and exuberant innovation can not only simplify this process and appraisal thereof but can incorporate here unmentioned domains such as SD fairness 38 , yet for true SD actionability in healthcare established ethical and legal consensus is paramount as these should dictate what actionability actually implies 5,8 .…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In part, the necessity of consistent scrutiny reiterates the need for established methods to assess SD’s actionability 2,23,24 . Regardless, SD can technically be made actionable, and exuberant innovation can not only simplify this process and appraisal thereof but can incorporate here unmentioned domains such as SD fairness 38 , yet for true SD actionability in healthcare established ethical and legal consensus is paramount as these should dictate what actionability actually implies 5,8 .…”
Section: Discussionmentioning
confidence: 99%
“…In this case, the goal of SD is to catalyse knowledge generation in healthcare, but without exposing real human subjects’ data. While use of SD within healthcare is still in development, ethicists, policy makers, and researchers are trying to establish how SD can be used appropriately 5,8 . Meanwhile, various techniques 9–12 are now openly available which researchers can use to train generative models and thus create SD for various purposes 13–22 .…”
Section: Introductionmentioning
confidence: 99%
“…However, as machine learning algorithms gained traction, they required and, in turn, propelled a turn towards datafication, data extraction and accumulation and, increasingly data production ostensibly ex nihilo through generative models for synthetic data (Jacobsen, 2023;Steinhoff, 2022). Propelled by the uptake of social media and portable devices and by broader turns across industries towards data-driven business models that made data monetization an essential component of the revenue and profitability of small and large businesses alike, datafication, 'surveillance capitalism' (Zuboff, 2019) and 'surveillance advertising' (Crain, 2021) turned towards the production of data as a commodity and as capital (Sadowski, 2019), that is, both as an asset that can be bought and sold, and as a mix of raw material and productive factor for the 'platform political economy' (Langley & Leyshon, 2021).…”
Section: Thinking Through Hardware: Gpus and The Making Of Artificial...mentioning
confidence: 99%
“…Indeed, as Amoore and colleagues have shown, while connectionism was present since the 1950s, with Rosenblatt’s model of the perceptron (Rosenblatt, 1958), and while it started to gain momentum in the 1980s, it was only in the early 2000s and some of the ‘victories’ such as the ImageNet competition, thanks to GPUs, that this epistemological approach became mainstream (Amoore et al, 2023, p. 1). Convolutional neural networks for image recognition had been performing well, but they were rejected by the scientific community because, according to Hinton, the task of AI had been to define deductively the task that the machine was meant to solve, rather than programming a machine that inductively performed well at a task (Hinton, 2019, p. 24).…”
Section: Thinking Through Hardware: Gpus and The Making Of Artificial...mentioning
confidence: 99%
See 1 more Smart Citation