OpenAI’s new Operator AI agent can do things on the web for you

Jan 24, 2025 01:00 AM - 2 weeks ago 20858

OpenAI is releasing a “research preview” of an AI supplier called Operator that tin “go to the web to execute tasks for you,” according to a blog post. “Using its ain browser, it tin look astatine a webpage and interact pinch it by typing, clicking, and scrolling,” OpenAI says. It’s launching first successful the US for subscribers of OpenAI’s $200 per period ChatGPT Pro tier.

Operator relies a “Computer-Using Agent” exemplary that combines GPT-4o’s imagination capabilities pinch “advanced reasoning done reinforcement learning” to beryllium capable to interact pinch GUIs, OpenAI says. “Operator tin ‘see’ (through screenshots) and ‘interact’ (using each the actions a rodent and keyboard allow) pinch a browser, enabling it to return action connected the web without requiring civilization API integrations,” according to OpenAI.

Operator tin usage reasoning to “self-correct,” and if it gets stuck, it will springiness the personification control. It will besides inquire the personification to return complete erstwhile a website asks for delicate accusation for illustration login credentials and “should” inquire for a personification to o.k. actions for illustration sending an email. OpenAI besides says that Operator has been designed to “refuse harmful requests and artifact disallowed content.”

OpenAI says that it’s collaborating pinch companies specified arsenic DoorDash, Instacart, OpenTable, Priceline, StubHub, Thumbtack, Uber truthful that Operator “addresses real-world needs while respecting established norms.” But the institution cautions that not everything mightiness activity arsenic you expect conscionable yet; the instrumentality presently has problems pinch “complex interfaces for illustration creating slideshows aliases managing calendars.”

Down the line, OpenAI says it plans to bring Operator to Plus, Team, and Enterprise users and “integrate these capabilities into ChatGPT.”

More