Will AI avoid exploitation? Artificial general intelligence and expected utility theory

Adam Bales

doi:10.1007/s11098-023-02023-4

Will AI avoid exploitation? Artificial general intelligence and expected utility theory

Adam Bales

Open Access

https://doi.org/10.1007/s11098-023-02023-4

Copy DOI

Journal: Philosophical Studies	Publication Date: Aug 5, 2023
License type: CC BY 4.0

Affiliation: University of Oxford

#Advanced AI #Expected Utility Theory + Show 6 more

Abstract
Full-Text PDF
Similar Papers

Abstract

AbstractA simple argument suggests that we can fruitfully model advanced AI systems using expected utility theory. According to this argument, an agent will need to act as if maximising expected utility if they’re to avoid exploitation. Insofar as we should expect advanced AI to avoid exploitation, it follows that we should expected advanced AI to act as if maximising expected utility. I spell out this argument more carefully and demonstrate that it fails, but show that the manner of its failure is instructive: in exploring the argument, we gain insight into how to model advanced AI systems.

Full Text