Abstract
Agents that focus only on achieving their own goals may cause significant harm to society. As a result, when deciding which actions to perform, agents have to consider societal values and how their actions impact these values - the `value alignment problem'. There is therefore a need to integrate quantitative machine reasoning with an ability to reason about the qualitative aspects of human values. In this paper, we present a novel framework for value-based reasoning that aims to bridge the gap between these two modes of reasoning. In particular, our framework extends the theory of grading to model how societal values can trade off with each other or with the agent's goals. Furthermore, our framework introduces the use of hyperreal numbers to represent both quantitative and qualitative aspects of reasoning and help address the value alignment problem.
Original language | English |
---|---|
Title of host publication | The 19th European Conference on Multi-Agent Systems |
Publication status | Accepted/In press - 25 Jul 2022 |