2020
DOI: 10.23974/ijol.2020.vol5.1.162
|View full text |Cite
|
Sign up to set email alerts
|

Managing Bias When Library Collections Become Data

Abstract: Developments in AI research have dramatically changed what we can do with data and how we can learn from data. At the same time, implementations of AI amplify the prejudices in data often framed as ‘data bias’ and ‘algorithmic bias.’ Libraries, tasked with deciding what is worth keeping, are inherently discriminatory and yet remain trusted sources of information. As libraries begin to systematically approach their collections as data, will they be able to adopt and adapt the AI-driven tools to traditional prac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 24 publications
0
3
0
2
Order By: Relevance
“…Each decision made during the course of ADS development -from the original business use case to the dataset selection and model training process -further biases the algorithm towards an intended outcome. The goal, therefore, is to understand and manage that bias since complete elimination of bias is not realistic (nor is it desired), and instead to mitigate and minimize harms that may arise from the decisions made by the algorithm or system [5,24]. Through the Dataset Nutrition Label, Data Nutrition Project team is committed to continuing to advance this important area of ADS development through well designed, carefully vetted, and context-aware Labels that help mitigate harm while also improving public understanding of the risks and opportunities of working with ADS, and with particular focus on their underlying data.…”
Section: Discussionmentioning
confidence: 99%
“…Each decision made during the course of ADS development -from the original business use case to the dataset selection and model training process -further biases the algorithm towards an intended outcome. The goal, therefore, is to understand and manage that bias since complete elimination of bias is not realistic (nor is it desired), and instead to mitigate and minimize harms that may arise from the decisions made by the algorithm or system [5,24]. Through the Dataset Nutrition Label, Data Nutrition Project team is committed to continuing to advance this important area of ADS development through well designed, carefully vetted, and context-aware Labels that help mitigate harm while also improving public understanding of the risks and opportunities of working with ADS, and with particular focus on their underlying data.…”
Section: Discussionmentioning
confidence: 99%
“…Hvor der på det enkelte bibliotek tidligere kunne ophobe sig mange materialer, skal systemet sikre at materialernes cirkulation bestemmes af efterspørgslen. Systemet samler ikke selv data ind, men baseret på maskinlaering anvender det data fra Cicero på nye måder (Liljegren, 2022, s.44;Coleman 2020). IMS centraliserer med andre ord yderligere styringen af materialebeholdningen.…”
Section: Digitalisering I Folkebibliotekerne I Danmarkunclassified
“…Technologies like artificial intelligence and machine learning are tools to manage data more efficiently (Boman, 2019) and to provide more generalized decisions and answers (Coleman, 2020). For a long time, algorithms in different search engines have captured the interest of researchers within library and information studies (see for example Bucher, 2018;Haider & Sundin, 2019;Noble, 2018).…”
Section: Library Automationmentioning
confidence: 99%
“…Even though IMMS does not collect any data, it manages data from the library system in a new way through machine learning. The use of machine learning provides a more generalized decision-making (Coleman, 2020). There is a tendency to view this as less biased (Coleman, 2020, p. 11) although it is important to note that the parameters are human made meaning bias will always be present (Haider & Sundin, 2020;Noble, 2018).…”
Section: The Limits Of Circulation Datamentioning
confidence: 99%