How does OpenAI’s Agents SDK sandboxing work?

Question

Hans Steiner · Accepted Answer

OpenAI updates Agents SDK for safer, in house agent testing OpenAI has released an update to its Agents SDK focused on security and operational reliability for agentic AI systems. The new version adds native sandboxing and an in distribution harness intended to help developers deploy and test agents on “long horizon” tasks. Native sandboxing is aimed at limiting what agents can do while they run, so that agent actions are constrained rather than executing with broad access to the surrounding environment. The intent is to reduce the risk surface that comes with autonomous or semi autonomous workflows—where an agent may make multi step tool calls over time. The purpose of the in distribution harness The “in distribution harness” is designed to let developers run agents inside the same overall environment they would use for deployment, but with tooling to support testing and evaluation. That matters because agent behavior can shift when task sequences get longer, when tools are called repeatedly, or when the agent needs to recover from earlier steps. Instead of treating agents as simple chat responses, the update acknowledges that real deployments involve execution over many steps.…

How does OpenAI’s Agents SDK sandboxing work?

OpenAI updates Agents SDK for safer, in-house agent testing

The purpose of the in-distribution harness

Why it matters for teams