What is OpenAI Lockdown Mode for prompt injection?

Question

Hans Steiner · Accepted Answer

OpenAI rolls out “Lockdown Mode” to reduce prompt injection data theft OpenAI has begun rolling out “Lockdown Mode” to ChatGPT, an optional security setting aimed at blocking attackers from stealing data through prompt injection attacks. Prompt injection is a common risk in systems that accept user provided content. In these attacks, malicious instructions are hidden in websites, documents, or other inputs so that an AI assistant follows the attacker’s guidance—potentially exposing sensitive information or extracting data the user intended to keep private. OpenAI’s Lockdown Mode is designed to mitigate that pathway by changing how ChatGPT processes and responds when prompt injection attempts are present. The rollout is positioned as an extra layer of protection rather than something meant for every user. The company has indicated most people do not need to use the feature, suggesting it may involve stricter handling that could reduce some capabilities or flexibility. The feature is also reflected across multiple coverage points, including the idea that it can disable or limit certain functions when enabled, trading convenience and breadth for improved security against…

What is OpenAI Lockdown Mode for prompt injection?

OpenAI rolls out “Lockdown Mode” to reduce prompt-injection data theft