Close Menu
AI News TodayAI News Today

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    AI Trusted Less Than Social Media and Airlines, With Grok Placing Last, Survey Says

    Anthropomorphic sculptures made of fake flowers and neck massagers

    Dairy Queen is putting an AI chatbot in its drive-thrus

    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook X (Twitter) Instagram Pinterest Vimeo
    AI News TodayAI News Today
    • Home
    • Shop
    • AI News
    • AI Reviews
    • AI Tools
    • AI Tutorials
    • Chatbots
    • Free AI Tools
    AI News TodayAI News Today
    Home»Chatbots»Testing suggests Google’s AI Overviews tells millions of lies per hour
    Chatbots

    Testing suggests Google’s AI Overviews tells millions of lies per hour

    By No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Gemini icon and chat bubbles
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Looking up information on Google today means confronting AI Overviews, the Gemini-powered search robot that appears at the top of the results page. AI Overviews has had a rough time since its 2024 launch, attracting user ire over its scattershot accuracy, but it’s getting better and usually provides the right answer. That’s a low bar, though. A new analysis from The New York Times attempted to assess the accuracy of AI Overviews, finding it’s right 90 percent of the time. The flip side is that 1 in 10 AI answers is wrong, and for Google, that means hundreds of thousands of lies going out every minute of the day.

    The Times conducted this analysis with the help of a startup called Oumi, which itself is deeply involved in developing AI models. The company used AI tools to probe AI Overviews with the SimpleQA evaluation, a common test to rank the factuality of generative models like Gemini. Released by OpenAI in 2024, SimpleQA is essentially a list of more than 4,000 questions with verifiable answers that can be fed into an AI.

    Oumi began running its test last year when Gemini 2.5 was still the company’s best model. At the time, the benchmark showed an 85 percent accuracy rate. When the test was rerun following the Gemini 3 update, AI Overviews answered 91 percent of the questions correctly. If you extrapolate this miss rate out to all Google searches, AI Overviews is generating tens of millions of incorrect answers per day.

    The report includes several examples of where AI Overviews went wrong. When asked for the date on which Bob Marley’s former home became a museum, AI Overviews cited three pages, two of which didn’t discuss the date at all. The final one, Wikipedia, listed two contradictory years, and AI Overviews confidently chose the wrong one. The benchmark also prompts models to produce the date on which Yo Yo Ma was inducted into the classical music hall of fame. While AI Overviews cited the organization’s website that listed Ma’s induction, it claimed there’s no such thing as the Classical Music Hall of Fame.

    Googles hour lies millions Overviews suggests tells testing
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleSCOTUS overturns 5th Circuit ruling that told ISP to kick pirates off Internet
    Next Article AirPods Max 2 vs AirPods Max (1st Gen): 13 Differences Compared
    • Website

    Related Posts

    Chatbots

    Dairy Queen is putting an AI chatbot in its drive-thrus

    Chatbots

    Once close enough for an acquisition, Stripe and Airwallex are now going after each other

    Chatbots

    The best cheap phones for 2026

    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    AI Trusted Less Than Social Media and Airlines, With Grok Placing Last, Survey Says

    0 Views

    Anthropomorphic sculptures made of fake flowers and neck massagers

    0 Views

    Dairy Queen is putting an AI chatbot in its drive-thrus

    0 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    AI Tutorials

    Quantization from the ground up

    AI Tools

    David Sacks is done as AI czar — here’s what he’s doing instead

    AI Reviews

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    AI Trusted Less Than Social Media and Airlines, With Grok Placing Last, Survey Says

    0 Views

    Anthropomorphic sculptures made of fake flowers and neck massagers

    0 Views

    Dairy Queen is putting an AI chatbot in its drive-thrus

    0 Views
    Our Picks

    Quantization from the ground up

    David Sacks is done as AI czar — here’s what he’s doing instead

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Terms & Conditions
    • Privacy Policy
    • Disclaimer

    © 2026 ainewstoday.co. All rights reserved. Designed by DD.

    Type above and press Enter to search. Press Esc to cancel.