Close Menu
AI News TodayAI News Today

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Backrooms is a certified blockbuster with a $38 million opening day

    Meta-Cognitive Regulation Might Be the Most Important AI Skill Nobody Is Talking About

    ‘What a joke’: Github Copilot’s new token-based billing spurs consternation among devs

    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook X (Twitter) Instagram Pinterest Vimeo
    AI News TodayAI News Today
    • Home
    • Shop
    • AI News
    • AI Reviews
    • AI Tools
    • AI Tutorials
    • Chatbots
    • Free AI Tools
    AI News TodayAI News Today
    Home»Chatbots»Apple working to cram massive Gemini model into iPhone to power new Siri
    Chatbots

    Apple working to cram massive Gemini model into iPhone to power new Siri

    By No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Apple working to cram massive Gemini model into iPhone to power new Siri
    Share
    Facebook Twitter LinkedIn Pinterest Email

    It’s impossible to totally avoid generative AI when interacting with technology anymore, but Apple has a bit less of it. That’s not entirely by choice, though. The iPhone maker has delayed the AI-enhanced Siri multiple times since first promising it in 2024, but a deal with Google will merge the iconic assistant with Gemini later this year. As we approach the Worldwide Developers Conference, Apple has been working to bring big AI smarts to the modest processing environment of a smartphone. Apple fans may not like the outcome, though.

    Apple has long crowed about the privacy value of running AI locally, but a new report suggests that despite Apple’s best efforts, the iPhone’s Gemini makeover will lean heavily on Google and Nvidia in the cloud. The Information reports that Apple’s Gemini-infused Siri will run both on-device and in the cloud, an apparent reversal of its privacy-focused preference for local AI.

    With every new chip announcement, we hear about how the silicon has been optimized for AI—even Apple does this with its focus on Neural Engine upgrades. You may think from the grandiose language that smartphones are equipped to handle beefy AI models, but that’s not necessarily the case. In fact, the GPUs in most phones can process more AI tokens than the AI-focused NPUs. Components like Apple’s Neural Engine are designed for contextual, efficient AI processing. Even if phones had faster AI processing, they lack the RAM to keep enormous models in memory.

    Even the largest AI models are still middling assistants, and that makes local AI very challenging. The AI models that run on phones are physically smaller, featuring at most a few billion parameters. Compare that to Google’s latest Gemini models, which have trillions of parameters, The Information reports. On-device AI models are also “quantized” to run at lower precision, making them faster but affecting the accuracy of token generation. This all adds up to AIs that feel less smart than their cloud brethren, and even big cloud-based models can be pretty dumb sometimes.

    The amazing, shrinking Gemini

    Google has versions of Gemini optimized for mobile devices, which it calls Gemini Nano. However, these are designed for powering contextual features like Magic Cue and audio summarization. Siri, on the other hand, is supposed to be a conversational assistant—you talk to it and it does things. That’s a different experience that requires a different kind of model. On Android, Google doesn’t even bother trying to do that locally. Talking to Gemini always goes straight to the cloud.

    Apple cram Gemini iPhone Massive model power Siri working
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleWaymo dominates autonomous vehicle registrations as Tesla trails behind
    Next Article Glean’s top line crosses $300M as AI budget-cutting becomes its major selling point
    • Website

    Related Posts

    Chatbots

    ‘What a joke’: Github Copilot’s new token-based billing spurs consternation among devs

    Chatbots

    Microsoft is threatening legal action for disclosing exploits

    Chatbots

    I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually pretty useful

    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Backrooms is a certified blockbuster with a $38 million opening day

    0 Views

    Meta-Cognitive Regulation Might Be the Most Important AI Skill Nobody Is Talking About

    0 Views

    ‘What a joke’: Github Copilot’s new token-based billing spurs consternation among devs

    0 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    AI Tutorials

    Quantization from the ground up

    AI Tools

    David Sacks is done as AI czar — here’s what he’s doing instead

    AI Reviews

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Backrooms is a certified blockbuster with a $38 million opening day

    0 Views

    Meta-Cognitive Regulation Might Be the Most Important AI Skill Nobody Is Talking About

    0 Views

    ‘What a joke’: Github Copilot’s new token-based billing spurs consternation among devs

    0 Views
    Our Picks

    Quantization from the ground up

    David Sacks is done as AI czar — here’s what he’s doing instead

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Terms & Conditions
    • Privacy Policy
    • Disclaimer

    © 2026 ainewstoday.co. All rights reserved. Designed by DD.

    Type above and press Enter to search. Press Esc to cancel.