Close Menu
AI News TodayAI News Today

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Today’s NYT Wordle Hints, Answer and Help for May 31 #1807

    AI grifters are creating fake Black people to sell Shein junk

    Pebblebee’s Halo watches my back and my belongings

    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook X (Twitter) Instagram Pinterest Vimeo
    AI News TodayAI News Today
    • Home
    • Shop
    • AI News
    • AI Reviews
    • AI Tools
    • AI Tutorials
    • Chatbots
    • Free AI Tools
    AI News TodayAI News Today
    Home»AI Tutorials»“a modest but tangible improvement”
    AI Tutorials

    “a modest but tangible improvement”

    By No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    “a modest but tangible improvement”
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Claude Opus 4.8: “a modest but tangible improvement”

    28th May 2026

    Anthropic shipped Claude Opus 4.8 today. My favourite thing about it is this note in the release announcement:

    Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost.

    It’s so refreshing to see an AI lab honestly describe a release as a minor incremental improvement over the previous model!

    Honesty seems to be a theme. Here’s my other favorite note from that announcement:

    One of the most prominent improvements in Opus 4.8 is its honesty. We train all our models to be honest—for instance, to avoid making claims that they can’t support. But a general problem with AI models is that they sometimes jump to conclusions, confidently claiming to have made progress in their work despite the evidence being thin. Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. This is borne out in our evaluations, which show that Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.

    That linked system card includes the following:

    Claude Opus 4.8 had the lowest incorrect-rate of the six models on every benchmark—the most direct measure of factual hallucination. It achieved this mainly by abstaining on questions about which it was uncertain rather than by answering more questions correctly.

    Model characteristics

    Not much has changed since 4.7.

    It’s priced the same as Opus 4.5/4.6/4.7—$5/million input and $25 per million output. “Fast mode” is twice that price, which is a significant reduction from their previous models—fast mode on 4.6/4.7 remains at $30/$150. Note that fast mode is only available to organizations that are part of the research preview, “Contact your account manager to request access”.

    Both the reliable knowledge cutoff and the training data cutoff are January 2026, the same as for 4.7.

    The context window is still 1,000,000 tokens, and the max output is 128,000 tokens.

    The What’s new in Claude Opus 4.8 document has some of the more interesting details. These caught my eye:

    Mid-conversation system messages. Claude Opus 4.8 accepts role: "system" messages immediately after a user turn in the messages array (subject to placement rules). This lets you append updated instructions later in a long-running conversation without restating the full system prompt, which preserves prompt cache hits on the earlier turns and reduces input cost on agentic loops.

    See also this update to the Anthropic Python SDK. Being able to steer the system prompt mid-conversation sounds really powerful. I was worried this would be incompatible with the abstraction provided by my own LLM library, which expects a single system prompt per conversation… but it turns out my recent redesign should handle that just fine.

    Lower prompt cache minimum. The minimum cacheable prompt length on Claude Opus 4.8 is 1,024 tokens, lower than on Claude Opus 4.7.

    I checked and 4.7’s minimum was 4,096.

    And some pelicans

    Here are pelicans riding bicycles for all five thinking levels, low, medium, high, xhigh, and max.

    This time I ran them using the LLM CLI, exported the logs to Markdown and then had Claude Opus 4.8 build me an HTML tool that could render that Markdown with the svg fenced code blocks displayed as SVGs on the page.

    This is the max one—it’s clearly the best, but it did take 25 input, 17,167 output tokens for a total cost of 43 cents!

    improvement modest tangible
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMicrosoft 365 Copilot gets a speed boost and cleaner design
    Next Article US healthcare still stupidly expensive, with pathetic outcomes, study finds
    • Website

    Related Posts

    AI Tutorials

    Take Google’s vibe coded I/O 2026 quiz

    AI Tutorials

    Students prototype learning tools with AI at University of Waterloo

    AI Tutorials

    Release: datasette 1.0a31

    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Today’s NYT Wordle Hints, Answer and Help for May 31 #1807

    0 Views

    AI grifters are creating fake Black people to sell Shein junk

    0 Views

    Pebblebee’s Halo watches my back and my belongings

    0 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    AI Tutorials

    Quantization from the ground up

    AI Tools

    David Sacks is done as AI czar — here’s what he’s doing instead

    AI Reviews

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Today’s NYT Wordle Hints, Answer and Help for May 31 #1807

    0 Views

    AI grifters are creating fake Black people to sell Shein junk

    0 Views

    Pebblebee’s Halo watches my back and my belongings

    0 Views
    Our Picks

    Quantization from the ground up

    David Sacks is done as AI czar — here’s what he’s doing instead

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Terms & Conditions
    • Privacy Policy
    • Disclaimer

    © 2026 ainewstoday.co. All rights reserved. Designed by DD.

    Type above and press Enter to search. Press Esc to cancel.