Close Menu
    • Home
    • Events
      • Upcoming Events
      • Videos
        • Machine Can Think Summit 2026
        • Step Dubai Conference 2026
    • Technology & Innovation
    • Business & Marketing
    • Trends & Insights
    • Industry Applications
    • Tutorials & Guides
    What's Hot
    Technology & Innovation

    SAS Puts AI Governance at the Core of Its Agent Strategy

    By Art RyanApril 29, 20260

    As it moves deeper into the era of agentic AI, SAS is making governance a…

    Big Tech AI Spending 2026: Investment Trends Revealed

    April 29, 2026

    Amazon AI Hiring Software Enhances Recruitment Efficiency

    April 29, 2026

    Oracle & CoreWeave Shares Fall on OpenAI Growth Miss

    April 29, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Breaking AI News
    Wednesday, April 29
    • Home
    • Events
      • Upcoming Events
      • Videos
        • Machine Can Think Summit 2026
        • Step Dubai Conference 2026
    • Technology & Innovation

      SAS Puts AI Governance at the Core of Its Agent Strategy

      April 29, 2026

      Amazon AI Hiring Software Enhances Recruitment Efficiency

      April 29, 2026

      AI Drug Development Johnson & Johnson Impact on Healthcare

      April 28, 2026

      Qualcomm OpenAI AI Smartphone Processors Partnership News

      April 28, 2026

      Google AI Campus South Korea and Its Development Plans

      April 28, 2026
    • Business & Marketing

      Big Tech AI Spending 2026: Investment Trends Revealed

      April 29, 2026

      Oracle & CoreWeave Shares Fall on OpenAI Growth Miss

      April 29, 2026

      Authentic Brands Group Could Hit $50 Billion in Retail Sales by 2026, CEO Says

      April 29, 2026

      UK AI Startup Ineffable Secures $1.1B in Europe’s Largest Seed Round

      April 28, 2026

      Meta Manus AI Acquisition Blocked Over Strategic Concerns

      April 28, 2026
    • Trends & Insights

      SAS Puts AI Governance at the Core of Its Agent Strategy

      April 29, 2026

      Big Tech AI Spending 2026: Investment Trends Revealed

      April 29, 2026

      Oracle & CoreWeave Shares Fall on OpenAI Growth Miss

      April 29, 2026

      Google AI Campus South Korea and Its Development Plans

      April 28, 2026

      Meta Manus AI Acquisition Blocked Over Strategic Concerns

      April 28, 2026
    • Industry Applications

      Amazon AI Hiring Software Enhances Recruitment Efficiency

      April 29, 2026

      AI Drug Development Johnson & Johnson Impact on Healthcare

      April 28, 2026

      Accenture Copilot Rollout Enhances Employee Productivity

      April 28, 2026

      HomeLight AI Real Estate Closings Transforming the Market

      April 27, 2026

      UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

      April 27, 2026
    • Tutorials & Guides

      How AI Is Revolutionizing the Future of Travel 2026 with Wellness and Sustainability

      April 19, 2026

      University of Wollongong in Dubai AI initiative boosts future-ready education

      March 31, 2026

      Microsoft AI upgrades Copilot Cowork unveiled for early access users

      March 31, 2026

      Starcloud $11 billion valuation signals AI space race surge

      March 31, 2026

      Flexible AI Factories Power the Future of Energy Grids

      March 30, 2026
    Breaking AI News
    Home » OpenAI Benchmark Tests AI Productivity as CFOs Demand ROI
    Technology & Innovation

    OpenAI Benchmark Tests AI Productivity as CFOs Demand ROI

    Art RyanBy Art RyanOctober 3, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Artificial intelligence’s credibility in enterprise now hinges on whether it can perform real professional work at the standard of a trained expert.

    That is the bar chief financial officers are setting as they weigh productivity, cost savings and return on investment. Finance chiefs are under pressure to scrutinize every AI dollar, demanding proof that projects move beyond experiments and into measurable economic value. A benchmark called GDPval introduced by OpenAI offers a concrete step in that direction by showing where AI is shifting from experimental to economically valuable.

    GDPval is the first large-scale attempt to measure whether frontier AI models can perform professional-grade tasks. It evaluates leading AI models on 1,320 tasks drawn from actual work across 44 occupations in nine industries that together account for $3 trillion in U.S. wages. These aren’t puzzles or tests; they are professional deliverables like financial forecasts, healthcare case analyses, legal memos, and sales presentations. On average, a human expert needed seven hours to complete each task, with an estimated value of nearly $400.

    What the Benchmark Shows

    When judged blindly against expert outputs, leading models showed near-parity. Claude Opus 4.1 produced deliverables rated equal to or better than human work in 47.6% of cases, particularly excelling at aesthetics like slide layout. GPT-5 led in accuracy, following instructions and handling calculations reliably.

    Pairing AI with human oversight also generated measurable returns. In scenarios where professionals reviewed and edited AI outputs, tasks were completed 1.1 to 1.6 times faster and cheaper than when humans worked alone. On average, model-only work still fell short of expert-level consistency, but in hybrid settings, output quality rose by more than 30% compared to unaided AI.

    The benchmark also revealed variation across industries: performance was strongest in finance and professional services tasks, where structured data and defined deliverables dominate, and weaker in healthcare and education, where nuance and contextual judgment mattered more.

    Where Leaders See the Payoff

    This evidence aligns with PYMNTS reporting on how firms are beginning to reconfigure workflows. The CAIO report finds 98% of leaders now expect generative AI to streamline workflows, up from 70% last year. Nearly as many (95%) anticipate sharper decision-making. Similarly, in healthcare, early AI deployments in billing and coding show measurable ROI, but executives consistently cite accuracy and liability as gating factors.

    Outside research supports the trajectory. A National Bureau of Economic Research study found that giving customer service agents access to generative AI boosted productivity by 14% on average, including a 34% improvement with junior staff seeing the largest gains. Meanwhile, McKinsey’s analysis continues to place the economic upside of generative AI in a similar range, estimating that the technology could unlock $2.6 trillion to $4.4 trillion annually across 63 use cases.

    The Blind Spots to Be Managed

    GDPval also highlights where AI still falls short. Across models, the most common failure mode was not following instructions. GPT-5’s misses were often cosmetic like formatting glitches or overly verbose outputs but about 3% of failures were catastrophic, meaning they could cause serious damage if deployed without oversight, such as giving the wrong medical advice or insulting a client. The study notes that these errors remain a limiting factor, even as models approach professional-level performance on many tasks.

    This mirrors PYMNTS coverage of AI “hallucinations” in compliance and payments contexts, where fabricated data or misinterpretations can quickly become regulatory landmines. Still, the trend indicates steady improvement, with each generation closing gaps that once seemed insurmountable.

    Source: https://www.pymnts.com/
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Art Ryan

    Related Posts

    SAS Puts AI Governance at the Core of Its Agent Strategy

    April 29, 2026

    Amazon AI Hiring Software Enhances Recruitment Efficiency

    April 29, 2026

    AI Drug Development Johnson & Johnson Impact on Healthcare

    April 28, 2026

    Comments are closed.

    Latest News

    SAS Puts AI Governance at the Core of Its Agent Strategy

    April 29, 2026

    Big Tech AI Spending 2026: Investment Trends Revealed

    April 29, 2026

    Amazon AI Hiring Software Enhances Recruitment Efficiency

    April 29, 2026

    Oracle & CoreWeave Shares Fall on OpenAI Growth Miss

    April 29, 2026
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram LinkedIn YouTube Spotify Reddit Snapchat Threads

    AI University

    • Global Universities
    • Universities in Africa
    • Universities in Asia
    • Universities in Europe
    • Universities in Latin America
    • Universities in Middle East
    • Universities in North America
    • Universities in Oceania

    AI Tools & Apps Directory

    • AI Productivity Tools
    • AI Coding Tools
    • AI Voice Tools
    • AI Video Tools
    • AI Image Generators
    • AI Writing Tools

    Info

    • Home
    • About Us
    • AI Organizations & Associations
    • Contact Us

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Breaking AI News.
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Sign Up

    Want to stay ahead In Artificial Intelligence?

     Sign up now and get exclusive breaking AI news and special updates—FREE!