2018
DOI: 10.1145/3274405
|View full text |Cite
|
Sign up to set email alerts
|

Trust in Data Science

Abstract: The trustworthiness of data science systems in applied and real-world settings emerges from the resolution of specific tensions through situated, pragmatic, and ongoing forms of work. Drawing on research in CSCW, critical data studies, and history and sociology of science, and six months of immersive ethnographic fieldwork with a corporate data science team, we describe four common tensions in applied data science work: (un)equivocal numbers, (counter)intuitive knowledge, (in)credible data, and (in)scrutable m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
33
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 134 publications
(44 citation statements)
references
References 80 publications
1
33
0
Order By: Relevance
“…For example, IoT sensors calibrated to measure the salinity of water may, over time, begin to provide incorrect values due to biofouling. Data science information products often rely on near real-time data to provide timely alerts, and, as such, problems may arise if these data quality issues are not timely detected and corrected (Gao et al 2015;Passi and Jackson 2018).…”
Section: The Role Of Data Governance With Regards To Data Science As mentioning
confidence: 99%
See 3 more Smart Citations
“…For example, IoT sensors calibrated to measure the salinity of water may, over time, begin to provide incorrect values due to biofouling. Data science information products often rely on near real-time data to provide timely alerts, and, as such, problems may arise if these data quality issues are not timely detected and corrected (Gao et al 2015;Passi and Jackson 2018).…”
Section: The Role Of Data Governance With Regards To Data Science As mentioning
confidence: 99%
“…Data scientists gain domain expertise and apply this knowledge in big data analysis to gain the best results (Gao et al 2015). However, the trustworthiness of data science outcomes in practice is often affected by tensions arising through ongoing forms of work (Passi and Jackson 2018). According to Passi and Jackson (2018), data science is a socio-material practice in which human agency and technology are mutually intertwined.…”
Section: The Role Of Data Governance With Regards To Data Science As mentioning
confidence: 99%
See 2 more Smart Citations
“…Below we describe the importance of cross-disciplinary infrastructure; describe five ways shared vocabularies support productive crossdisciplinary work; offer a definitional overview of concepts of fairness, examining how the term is defined and conceptualized in different disciplines; provide a Fairness Analytic to provide a set of dimensions to discuss concepts fairness across definitions and disciplines; apply these tools to a case study; and reflect on this work as an incremental step toward building a shared vocabulary that fosters the collaborative work of this emerging community. While the community focused on empirical analysis to design, including: critical analysis of harms such systems can cause (particularly when purportedly recognizing and categorizing gender) and the violence they can perpetuate [18,65,75]; ethnographic investigation of how data science and algorithmic work is taught, learned, and enacted in practice [116,117]; user perceptions of "fairness" [94]; and methods to design algorithms while being cognizant of stakeholder values [154].…”
Section: Introductionmentioning
confidence: 99%