2024
DOI: 10.1002/aaai.12167
|View full text |Cite
|
Sign up to set email alerts
|

Engineering AI for provable retention of objectives over time

Adeniyi Fasoro

Abstract: I argue that ensuring artificial intelligence (AI) retains alignment with human values over time is critical yet understudied. Most research focuses on static alignment, neglecting crucial retention dynamics enabling stability during learning and autonomy. This paper elucidates limitations constraining provable retention, arguing key gaps include formalizing dynamics, transparency of advanced systems, participatory scaling, and risks of uncontrolled recursive self‐improvement. I synthesize technical and ethica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 34 publications
(85 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?