2020
DOI: 10.1007/s11023-020-09539-2
|View full text |Cite
|
Sign up to set email alerts
|

Artificial Intelligence, Values, and Alignment

Abstract: This paper looks at philosophical questions that arise in the context of AI alignment. It defends three propositions. First, normative and technical aspects of the AI alignment problem are interrelated, creating space for productive engagement between people working in both domains. Second, it is important to be clear about the goal of alignment. There are significant differences between AI that aligns with instructions, intentions, revealed preferences, ideal preferences, interests and values. A principle-bas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
179
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 312 publications
(182 citation statements)
references
References 52 publications
1
179
0
2
Order By: Relevance
“…Firstly, the introduced approaches can be again summarized by recapping two further wordings of Gabriel [17]:…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Firstly, the introduced approaches can be again summarized by recapping two further wordings of Gabriel [17]:…”
Section: Discussionmentioning
confidence: 99%
“…He then elaborates on challenges for certain groups, such as antisocial psychopaths, but also children as well as nonhuman animals. Additionally, Gabriel mentions in his elaboration on AI alignment nonhuman animals and sentient beings in general initially, but not anymore when he explores the details of value extraction and aggregation [17]. Sarma and colleagues argue for an interdisciplinary approach for the AI value alignment problem, involving neuroscience in particular, which may lead to insights of what they call "mammalian value systems" [18,19].…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations