Generative artificial intelligence heavyweight OpenAI on Thursday previewed an AI agent that can carry out tasks on the web for users, as it seeks to enhance its chatbot amid intensifying competition.
The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that enable Operator to interact with the screen (clicking buttons, typing, scrolling, etc.).
Samsung, Google join forces to tackle the AI boom, facing competition from OpenAI and Apple, redefining innovation in the smartphone market with the Galaxy S25
This development follows the introduction of the o3 series, designed to enhance AI's ability to tackle complex problems through improved reasoning capabilities. The o3 mini model represents a significant leap from its predecessor, o1, by incorporating advanced reasoning skills that allow for step-by-step logical analysis.
The company says the CUA’s reasoning technique, which they call an “inner monologue,” helps the model understand intermediate steps and adapt to unexpected input. Under the hood, CUA takes screenshots of web pages and uses a virtual mouse and keyboard to navigate.
Chinese AI firm DeepSeek announced a new model called R1, which the outfit says is performing "on par" with OpenAI's o1.
Jan 14 (Reuters) - Generative artificial intelligence bellwether OpenAI said on Tuesday that it is introducing a beta feature called Tasks to ChatGPT, signaling the company's foray into the virtual assistant space, competing with Apple's (AAPL.O), opens ...
OpenAI, Oracle, Softbank and MGX are investing a record amount in new AI infrastructure even as China's DeepSeek outperforms on cost.
ChatGPT is OpenAI's extremely useful chatbot for answering questions. Here's how to use the generative AI tool in Apple's Notes app in macOS.
The company announced it was testing advanced reasoning models, o3 and o3 mini, designed to address more complex tasks compared to earlier iterations.
The era of ChatGPT doing stuff for you has arrived.
GENERATIVE artificial intelligence (AI) heavyweight OpenAI on Thursday (Jan 23) previewed an AI agent that can carry out tasks on the web for users, as it seeks to enhance its chatbot amid intensifying competition.