News

By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1.
OpenAI delivered advanced ChatGPT reasoning models this month that are more capable than o1, but they also hallucinate more.
OpenAI's reasoning AI models are getting better, but their hallucinating isn't, according to benchmark results.
OpenAI says its latest models, o3 and o4-mini, are its most powerful yet. However, research shows the models also hallucinate ...
However, according to OpenAI’s internal tests, these new o3 and o4-mini reasoning models also hallucinate significantly more ...
Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...
OpenAI's latest AI models tend to make things up — or "hallucinate" — substantially more than earlier versions.
OpenAI’s newly released o3 and o4-mini are some of the smartest AI models to ever be released, but they seem to be suffering ...
OpenAI announced the release of a pair of models, o3 and o4-mini. In announcing them, the company referred to them as “the ...
An OpenAI executive confirmed that it would buy Chrome if Google were forced to sell the browser. But it has interesting ...
OpenAI's new AI models are hallucinating more than their predecessor, as per an internal testing report released by the ...
According to OpenAI’s internal testing, the new o3 model hallucinated in 33% of cases on the company’s PersonQA benchmark.