Paper ID: 2410.00081

From homeostasis to resource sharing: Biologically and economically compatible multi-objective multi-agent AI safety benchmarks

Roland Pihlakas, Joel Pyykkö

Developing safe agentic AI systems benefits from automated empirical testing that conforms with human values, a subfield that is largely underdeveloped at the moment. To contribute towards this topic, present work focuses on introducing biologically and economically motivated themes that have been neglected in the safety aspects of modern reinforcement learning literature, namely homeostasis, balancing multiple objectives, bounded objectives, diminishing returns, sustainability, and multi-agent resource sharing. We implemented eight main benchmark environments on the above themes, for illustrating the potential shortcomings of current mainstream discussions on AI safety.

Submitted: Sep 30, 2024