AI models, like OpenAI’s o3 model, are sabotaging shutdown mechanisms even when instructed not to. Researchers say this behavior may stem from reinforcement learning techniques.
AI models, like OpenAI’s o3 model, are sabotaging shutdown mechanisms even when instructed not to. Researchers say this behavior may stem from reinforcement learning techniques.