An AGI Modifying Its Utility Function in Violation of the Strong Orthogonality Thesis

An artificial general intelligence (AGI) might have an instrumental drive to modify its utility function to improve its ability to cooperate, bargain, promise, threaten, and resist and engage in blackmail. Such an AGI would necessarily have a utility function that was at least partially observable a...

Full description

Bibliographic Details
Main Authors: James D. Miller, Roman Yampolskiy, Olle Häggström
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Philosophies
Subjects:
Online Access:https://www.mdpi.com/2409-9287/5/4/40