LOGO

OpenAI Upgrades AI Model for Operator Agent

May 23, 2025
OpenAI Upgrades AI Model for Operator Agent

OpenAI Enhances Operator with Advanced AI Model

OpenAI is implementing an upgrade to the AI model that drives Operator, its autonomous AI agent. This agent is designed to independently navigate the internet and utilize specific software applications within a cloud-based virtual machine to address user requests.

Transition to the o3 Model

The upcoming update will integrate a model built upon o3, representing a recent advancement within OpenAI’s o series of “reasoning” models. Previously, Operator functioned using a customized version of GPT-4o.

Evaluations across numerous benchmarks demonstrate that o3 is a significantly more capable model, especially in areas requiring mathematical computation and logical reasoning.

API Differences

“A transition is underway, replacing the current GPT‑4o-based model within Operator with one leveraging OpenAI o3,” OpenAI stated in a recent blog post. The API version of Operator, however, will continue to be powered by 4o.

The Rise of AI Agents

Operator is part of a growing trend of agentic tools being developed and released by AI companies. There is considerable competition to create highly sophisticated agents capable of executing tasks with minimal human oversight.

Competitive Landscape

Google provides a “computer use” agent through its Gemini API, offering similar web browsing and action-taking capabilities. They also have a consumer-focused agent named Mariner. Furthermore, Anthropic’s models can also handle computer-based tasks, including file management and web navigation.

Enhanced Safety Features

According to OpenAI, the new Operator model, designated o3 Operator, has undergone “fine-tuning with supplementary safety data for computer usage.” This includes datasets specifically designed to refine the model’s decision-making process regarding confirmations and refusals.

Safety Evaluation Results

OpenAI has published a technical report detailing o3 Operator’s performance on specific safety assessments. The report indicates that, compared to the GPT-4o Operator model, o3 Operator exhibits a reduced tendency to decline “illicit” requests and is less prone to searching for sensitive personal information.

It is also less vulnerable to prompt injection, a type of AI security exploit, as outlined in the technical report.

Safety and Coding Access

o3 Operator employs the same multi-layered safety approach utilized in the 4o version of Operator,” OpenAI clarified in its blog post. Despite inheriting o3’s coding abilities, it does not possess inherent access to a coding environment or terminal.

#openai#operator agent#ai model#ai upgrade#artificial intelligence