How does OpenAI Lockdown Mode mitigate prompt injection?

Question

Hans Steiner · Accepted Answer

OpenAI’s Lockdown Mode: limiting features against prompt injection OpenAI has rolled out “Lockdown Mode,” an optional security setting aimed at reducing the risk from prompt injection attacks. Prompt injection is a class of attack where malicious instructions are embedded in content that a model later follows—potentially steering the system to reveal information, take unsafe actions, or ignore higher level constraints. In the coverage, Lockdown Mode is described as a protection that works by limiting some features . The intent is to shrink the attack surface: if the model is configured to have fewer capabilities or fewer interactions during a request, there are fewer pathways for an injected instruction to successfully manipulate behavior. The story emphasizes two practical points for users: It’s optional , implying most users don’t need the strictest settings all the time. The feature is targeted at advanced protection against prompt injection rather than general chatbot “safety.” For organizations and developers, the release matters because it signals a concrete product level response to an escalating security problem. Prompt injection has become a key risk wherever LLMs can be…