Close Menu
    What's Hot
    Technology & Innovation

    Google I/O 2026: Sundar Pichai Announces Agentic Gemini Era

    By Art RyanMay 20, 20260

    Officially, Google I/O 2026 marked the dawn of a new age in artificial intelligence. Among…

    Google and Blackstone $5B AI Cloud Venture to Rival CoreWeave

    May 20, 2026

    Moment Raises $78M to Revolutionize Wealth Management

    May 20, 2026

    Thailand and Alipay+ to Accelerate AI-Driven Tourism Collaboration

    May 19, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Breaking AI News
    Wednesday, May 20
    • Home
    • Events
    • Videos
      • Machine Can Think Summit 2026
      • Step Dubai Conference 2026
    • Technology & Innovation

      Google I/O 2026: Sundar Pichai Announces Agentic Gemini Era

      May 20, 2026

      Google and Blackstone $5B AI Cloud Venture to Rival CoreWeave

      May 20, 2026

      Moment Raises $78M to Revolutionize Wealth Management

      May 20, 2026

      Thailand and Alipay+ to Accelerate AI-Driven Tourism Collaboration

      May 19, 2026

      AI Everything Kenya 2026 Officially Kicks Off Today in Nairobi

      May 19, 2026
    • Business & Marketing

      Google and Blackstone $5B AI Cloud Venture to Rival CoreWeave

      May 20, 2026

      Moment Raises $78M to Revolutionize Wealth Management

      May 20, 2026

      Dubai Holding Partners With Microsoft to Accelerate AI Adoption

      May 19, 2026

      Dust Raises $40M Series B to Scale AI Enterprise Workspaces

      May 19, 2026

      Baidu Beats Estimates on Agentic AI Strategy

      May 19, 2026
    • Industry Applications

      Moment Raises $78M to Revolutionize Wealth Management

      May 20, 2026

      Dubai Holding Partners With Microsoft to Accelerate AI Adoption

      May 19, 2026

      Dubai GDRFA Unveils AI-Powered System to Transform Services

      May 19, 2026

      UAE Rolls Out Massive Agentic AI Training for 80,000 Employees

      May 19, 2026

      NextEra Dominion $67B Merger Shows AI Power Demand

      May 19, 2026
    • Trends & Insights

      Google I/O 2026: Sundar Pichai Announces Agentic Gemini Era

      May 20, 2026

      NextEra Dominion $67B Merger Shows AI Power Demand

      May 19, 2026

      Baidu Beats Estimates on Agentic AI Strategy

      May 19, 2026

      Ghana AI Healthcare Programme for Quality Healthcare Access

      May 18, 2026

      Israel National AI Strategy Drives AI Talent and Startup Innovation

      May 18, 2026
    • AI Travel Technology News

      New EU AI Border System May Bring Travel Delays for US Tourists

      May 18, 2026

      AI-Powered Apps Drive Colombia’s Birdwatching Tourism Growth

      May 18, 2026

      Japan Expands e-Gates to Ease Travel for Foreign Visitors

      May 15, 2026

      Vietnam’s $300B Digital Tech Push May Boost AI Travel by 2030

      May 15, 2026

      Sabre IQ AI Travel Platform: Redefining Travel Management

      May 14, 2026
    Breaking AI News
    Home » OpenAI Says AI Hallucinations Are Systemic, Not a Bug
    Technology & Innovation

    OpenAI Says AI Hallucinations Are Systemic, Not a Bug

    Art RyanBy Art RyanSeptember 10, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Large language models don’t just make mistakes. They sometimes invent answers with striking confidence. A new paper from OpenAI researchers Adam Tauman Kalai, Ofir Nachum, and colleagues argues that these “hallucinations” are not mysterious glitches but predictable byproducts of the way today’s artificial intelligence (AI) systems are trained and tested.

    The report, “Why Language Models Hallucinate,” traces the problem to two root causes: the way models learn language during pretraining, and the way they are judged during evaluation. Together, these forces create statistical pressure to guess rather than to acknowledge uncertainty.

    The first stage, pretraining, exposes a model to massive datasets. The researchers argue that even if those datasets were perfect, hallucinations would still occur because the training objective — predicting the next word — maps onto the same error patterns seen in binary classification. For example, if a model sees a celebrity’s birthday once in training, it cannot reliably reproduce it later. As the authors explain, hallucinations are simply “errors in binary classification” magnified by the task of generating fluent language.

    The paper illustrates this with striking cases. When asked the birthday of one of the paper’s authors, Adam Tauman Kalai, an open-source model confidently supplied three different but incorrect dates, even though the correct answer was not in its training set.

    In another test, when asked to count the number of Ds in the word DEEPSEEK, several models produced answers ranging from 2 to 7, none of them correct. These examples, the authors argue, show how models “fill in the blanks” with plausible guesses when they lack reliable information or when the task itself is poorly represented in training.

    Why Post-Training Keeps Errors Alive

    The second stage, post-training, is supposed to refine models and reduce errors. Yet the paper argues that evaluation systems — benchmarks and leaderboards — end up encouraging bluffing instead of honesty. Most widely used tests reward correct answers but assign zero points to uncertainty or an “I don’t know” response. That means a model that always guesses will consistently score better than one that admits gaps in its knowledge.

    As the authors put it: “Optimizing models for these benchmarks may therefore foster hallucinations. Humans learn the value of expressing uncertainty outside of school, in the school of hard knocks. On the other hand, language models are primarily evaluated using exams that penalize uncertainty. Therefore, they are always in ‘test-taking’ mode.”

    This framing helps explain why hallucinations remain stubborn even in the most advanced systems. Improvements in architecture, scale and alignment don’t change the fact that the scoring rules push models toward overconfidence.

    The paper concludes that the solution isn’t another hallucination test but a redesign of the evaluation system itself. By modifying benchmarks to give partial credit for uncertainty, much like standardized exams that penalize wrong guesses, developers can realign incentives. The authors suggest explicit confidence thresholds, where models only answer if they are more than, say, 75% sure.

    For professionals in finance, payments and other sectors where accuracy is nonnegotiable, the takeaway is sobering. Hallucinations aren’t random quirks; they are systemic. They can also be expensive for businesses and consumers alike. Insurance companies, earlier this year, started covering AI hallucination mishaps.

    Unless the field changes how it measures performance, AI systems will continue to “sound right” while sometimes being wrong. But with better scoring, the researchers argue, AI could be nudged toward becoming a more trustworthy partner in high-stakes decision-making.

    Source: https://www.pymnts.com/
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Art Ryan

    Related Posts

    Google I/O 2026: Sundar Pichai Announces Agentic Gemini Era

    May 20, 2026

    Google and Blackstone $5B AI Cloud Venture to Rival CoreWeave

    May 20, 2026

    Moment Raises $78M to Revolutionize Wealth Management

    May 20, 2026

    Comments are closed.

    Latest News

    Google I/O 2026: Sundar Pichai Announces Agentic Gemini Era

    May 20, 2026

    Google and Blackstone $5B AI Cloud Venture to Rival CoreWeave

    May 20, 2026

    Moment Raises $78M to Revolutionize Wealth Management

    May 20, 2026

    Thailand and Alipay+ to Accelerate AI-Driven Tourism Collaboration

    May 19, 2026
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram LinkedIn YouTube Spotify Reddit Snapchat Threads

    AI University

    • Global Universities
    • Universities in Africa
    • Universities in Asia
    • Universities in Europe
    • Universities in Latin America
    • Universities in Middle East
    • Universities in North America
    • Universities in Oceania

    AI Tools & Apps Directory

    • AI Productivity Tools
    • AI Coding Tools
    • AI Voice Tools
    • AI Video Tools
    • AI Image Generators
    • AI Writing Tools

    Info

    • Home
    • About Us
    • AI Organizations & Associations
    • Contact Us
    • Cookie Policy
    • Copyright Policy
    • Disclaimer
    • Editorial Policy
    • Terms and Conditions

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Breaking AI News.
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Sign Up

    Want to stay ahead In Artificial Intelligence?

     Sign up now and get exclusive breaking AI news and special updates—FREE!