Advanced OpenAI models hallucinate more than older versions, internal report finds

TempermentalAnomaly@lemmy.world · 1 day ago

Advanced OpenAI models hallucinate more than older versions, internal report finds

glowie@infosec.pub · 1 day ago

Just a feeling, but from anecdotal experience it seems like the initial release was very good and they quickly realized just how powerful of a tool it was for the average person and now they’ve dumbed it down in many ways on purpose.

slacktoid@lemmy.ml · 1 day ago

They had to add all the safeguards that also nerfed it.

clearedtoland@lemmy.world · 1 day ago

Agreed. There was a time when it worked impressively well, but it’s become increasingly lazy, forgetful, and confidently wrong, even missing obvious explicit prompts. If you’re using it thoughtfully as an augment, fine. But if you’re relying on it blindly, it’s risky.

That said, in my experience, Anthropic and OpenAI are still miles ahead. Perplexity had me hooked for a while, but its results have nosedived lately. I know they tune their own model while drawing from OpenAI and DeepSeek vs their own true model but still, whatever they’re doing could use some undoing.