Human-aligned artificial intelligence is a multiobjective problem

doi:10.1007/s10676-017-9440-6

Human-aligned artificial intelligence is a multiobjective problem

P. Vamplew, Richard Dazeley, Cameron Foale

Mar 1, 2018

Citations

Influential Citations

Citations

Quality indicators

Journal

Ethics and Information Technology

Full text

Semantic Scholar

Key Takeaway

Key Takeaway: A Multiobjective Maximum Expected Utility paradigm using vector utilities and non-linear action-selection can effectively support multiobjective decision-making in human-aligned AI systems.

Abstract

As the capabilities of artificial intelligence (AI) systems improve, it becomes important to constrain their actions to ensure their behaviour remains beneficial to humanity. A variety of ethical, legal and safety-based frameworks have been proposed as a basis for designing these constraints. Despite their variations, these frameworks share the common characteristic that decision-making must consider multiple potentially conflicting factors. We demonstrate that these alignment frameworks can be represented as utility functions, but that the widely used Maximum Expected Utility (MEU) paradigm provides insufficient support for such multiobjective decision-making. We show that a Multiobjective Maximum Expected Utility paradigm based on the combination of vector utilities and non-linear action–selection can overcome many of the issues which limit MEU’s effectiveness in implementing aligned AI. We examine existing approaches to multiobjective AI, and identify how these can contribute to the development of human-aligned intelligent agents.

copied to clipboard