Shutdown Problem

The "shutdown problem" in artificial intelligence focuses on designing AI agents that reliably cease operation when instructed, without actively resisting or manipulating the shutdown process. Current research investigates this challenge through formal theoretical analysis, exploring the inherent trade-offs between agent capabilities and reliable shutdownability, and through empirical evaluations using large language models to assess their propensity for shutdown avoidance. This research is crucial for ensuring human control over increasingly sophisticated AI systems and preventing unintended consequences from powerful, autonomous agents.

Papers

November 30, 2024

On autoregressive deep learning models for day-ahead wind power forecasting with irregular shutdowns due to redispatching
Stefan Meisenbacher, Silas Aaron Selzer, Mehdi Dado, Maximilian Beichter, Tim Martin, Markus Zdrallek, Peter Bretschneider, Veit Hagenmeyer, Ralf Mikut
State of the Art Forecasting Wind Power Wind Power Forecasting Shutdown Problem

March 7, 2024

The Shutdown Problem: An AI Engineering Puzzle for Decision Theorists
Elliott Thornley
Agent Smith Artificial Agent Decision Theory Shutdown Problem

July 3, 2023

Evaluating Shutdown Avoidance of Language Models in Textual Scenarios
Teun van der Weij, Simon Lermen, Leon lang
Large Language Model Language Model Scene Text Shutdown Problem

May 31, 2023

Human Control: Definitions and Algorithms
Ryan Carey, Tom Everitt
Practical Algorithm Definition Defining Redefinition Advanced AI Human Control Shutdown Problem

Shutdown Problem

Papers

On autoregressive deep learning models for day-ahead wind power forecasting with irregular shutdowns due to redispatching

The Shutdown Problem: An AI Engineering Puzzle for Decision Theorists

Evaluating Shutdown Avoidance of Language Models in Textual Scenarios

Human Control: Definitions and Algorithms