Close Menu
TechurzTechurz

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Microsoft AI launches its first in-house models

    August 29, 2025

    Samsung offers enticing preorder deal for new Galaxy tablets ahead of September Unpacked

    August 29, 2025

    Nvidia CEO: Some Jobs Will Disappear As AI Advances

    August 29, 2025
    Facebook X (Twitter) Instagram
    Trending
    • Microsoft AI launches its first in-house models
    • Samsung offers enticing preorder deal for new Galaxy tablets ahead of September Unpacked
    • Nvidia CEO: Some Jobs Will Disappear As AI Advances
    • Google’s new Pixel phone insurance includes unlimited claims, but is it legit? I did the math
    • Lost luggage hauls are the internet’s strangest new trend
    • Salt Typhoon APT techniques revealed in new report
    • Today’s Wordle #1532 Hints And Answer For Friday, August 29th
    • Onboarding Success: Learn the Cold Start Algorithm
    Facebook X (Twitter) Instagram Pinterest Vimeo
    TechurzTechurz
    • Home
    • AI
    • Apps
    • News
    • Guides
    • Opinion
    • Reviews
    • Security
    • Startups
    TechurzTechurz
    Home»AI»OpenAI gets caught vibe graphing
    AI

    OpenAI gets caught vibe graphing

    TechurzBy TechurzAugust 9, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    OpenAI gets caught vibe graphing
    Share
    Facebook Twitter LinkedIn Pinterest Email

    During its big GPT-5 livestream on Thursday, OpenAI showed off a few charts that made the model seem quite impressive — but if you look closely, some graphs were a little bit off.

    In one, ironically showing how well GPT-5 does in “deception evals across models,” the scale is all over the place. For “coding deception,” for example, the chart shown onstage says GPT-5 with thinking apparently gets a 50.0 percent deception rate, but that’s compared to OpenAI’s smaller 47.4 percent o3 score which somehow has a larger bar. OpenAI appears to have accurate numbers for this chart in its GPT-5 blog post, however, where GPT-5’s deception rate is labeled as 16.5 percent.

    With this chart, OpenAI showed onstage that one of GPT-5’s scores is lower than o3’s but is shown with a bigger bar. In this same chart, o3 and GPT-4o’s scores are different but shown with equally-sized bars. It was bad enough that CEO Sam Altman commented on it, calling it a “mega chart screwup,” though he noted that a correct version is in OpenAI’s blog post.

    An OpenAI marketing staffer also apologized, saying, “We fixed the chart in the blog guys, apologies for the unintentional chart crime.”

    On Friday, in response to a Reddit user asking about the graphs, Altman said that “the numbers here were accurate but we screwed up the bar charts in the livestream overnight; on another slide we screwed up numbers.” He also noted that the blog post and system card were “accurate” and said that “people were working late and were very tired, and human error got in the way. A lot comes together for a livestream in the last hours.”

    It’s still not a great look for the company on its big launch day — especially when it is touting the “significant advances in reducing hallucinations” with its new model.

    Update, August 8th: Added Reddit comment from Altman.

    Caught Graphing OpenAI vibe
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous Article8 settings to change on your Google Pixel phone for significantly better battery life
    Next Article Everything You Need to Know to Update Your Fire Stick
    Techurz
    • Website

    Related Posts

    AI

    Microsoft AI launches its first in-house models

    August 29, 2025
    AI

    Google’s new Pixel phone insurance includes unlimited claims, but is it legit? I did the math

    August 29, 2025
    AI

    Onboarding Success: Learn the Cold Start Algorithm

    August 28, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views

    You Can Now Get Starlink for $15-Per-Month in New York, but There’s a Catch

    July 11, 20257 Views

    Non-US businesses want to cut back on using US cloud systems

    June 2, 20257 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views

    You Can Now Get Starlink for $15-Per-Month in New York, but There’s a Catch

    July 11, 20257 Views

    Non-US businesses want to cut back on using US cloud systems

    June 2, 20257 Views
    Our Picks

    Microsoft AI launches its first in-house models

    August 29, 2025

    Samsung offers enticing preorder deal for new Galaxy tablets ahead of September Unpacked

    August 29, 2025

    Nvidia CEO: Some Jobs Will Disappear As AI Advances

    August 29, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2025 techurz. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.