2019
DOI: 10.1126/science.aag3311
|View full text |Cite
|
Sign up to set email alerts
|

Preventing undesirable behavior of intelligent machines

Abstract: Intelligent machines using machine learning algorithms are ubiquitous, ranging from simple data analysis and pattern recognition tools to complex systems that achieve superhuman performance on various tasks. Ensuring that they do not exhibit undesirable behavior—that they do not, for example, cause harm to humans—is therefore a pressing problem. We propose a general and flexible framework for designing machine learning algorithms. This framework simplifies the problem of specifying and regulating undesirable b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
80
0
2

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 118 publications
(82 citation statements)
references
References 145 publications
0
80
0
2
Order By: Relevance
“…In a recent article published by Thomas PS et al, the application of a defined Seldon algorithm is described. The authors propose a framework for designing machine learning algorithms and show how it can be used to construct algorithms that provide their users with the ability to easily place limits on the probability that the algorithm will produce any specified undesirable behavior [23].…”
Section: Discussionmentioning
confidence: 99%
“…In a recent article published by Thomas PS et al, the application of a defined Seldon algorithm is described. The authors propose a framework for designing machine learning algorithms and show how it can be used to construct algorithms that provide their users with the ability to easily place limits on the probability that the algorithm will produce any specified undesirable behavior [23].…”
Section: Discussionmentioning
confidence: 99%
“…Techniques have been developed to reduce social bias in some neural networks designs such as adjusting the high dimensional vectors representing individual words to remove differences in the distance from the concepts of male and female in word2vec [11] and the Seldonian approach of describing and regulating undesirable behavior [12].…”
Section: Related Workmentioning
confidence: 99%
“…This group recommended that the development, deployment, and use of AI systems should adhere to the ethical principles of respect for human autonomy, prevention of harm, fairness/equity and explicability [27]. It is clear that an intrinsically ethical AI could be a solution to avoid some drawbacks, i.e., the case of the Seldon algorithm which places limits to the probability that an algorithm will produce any specified undesirable behaviour [28].…”
Section: Solutions: the Algor-ethicsmentioning
confidence: 99%