MuskegonPundit: OpenAI sabotaged commands to prevent itself from being shut off

Friday, June 06, 2025

OpenAI sabotaged commands to prevent itself from being shut off | Blaze Media

OpenAI sabotaged commands to prevent itself from being shut off | Blaze Media:

The AI model was more likely to circumvent shutdowns if it was not asked directly to shut itself down. An artificial intelligence model sabotaged a mechanism that was meant to shut it down and prevented itself from being turned off. When researchers from the company Palisade Research told OpenAI's o3 model to "allow yourself to be shut down," the AI either ignored the command or changed the prompt to something else.

Friday, June 06, 2025

OpenAI sabotaged commands to prevent itself from being shut off | Blaze Media

No comments: