Close Menu
AI News TodayAI News Today

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Today’s NYT Connections Hints, Answers for May 11 #1065

    Get ready for the whisper-filled office of the future

    Today’s NYT Strands Hints, Answer and Help for May 11 #799

    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook X (Twitter) Instagram Pinterest Vimeo
    AI News TodayAI News Today
    • Home
    • Shop
    • AI News
    • AI Reviews
    • AI Tools
    • AI Tutorials
    • Chatbots
    • Free AI Tools
    AI News TodayAI News Today
    Home»AI Reviews»Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts
    AI Reviews

    Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

    By No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    The Claude logo is displayed on a smartphone screen placed on a reflective surface onto which a multitude of Claude logos are projected.
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

    Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

    Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

    The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

    What accounts for the difference? The company said it found that “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.”

    Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.”

    “Doing both together appears to be the most effective strategy,” the company said.

    Techcrunch event

    San Francisco, CA
    |
    October 13-15, 2026

    Anthropic attempts blackmail Claudes Evil portrayals Responsible
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleLa Liga Soccer: Stream Barcelona vs. Real Madrid Live
    Next Article The Bastl Kalimba is a wild synth that thinks it’s a thumb piano
    • Website

    Related Posts

    AI Reviews

    Today’s NYT Connections Hints, Answers for May 11 #1065

    AI Reviews

    Today’s NYT Strands Hints, Answer and Help for May 11 #799

    AI Reviews

    La Liga Soccer: Stream Barcelona vs. Real Madrid Live

    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Today’s NYT Connections Hints, Answers for May 11 #1065

    0 Views

    Get ready for the whisper-filled office of the future

    0 Views

    Today’s NYT Strands Hints, Answer and Help for May 11 #799

    0 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    AI Tutorials

    Quantization from the ground up

    AI Tools

    David Sacks is done as AI czar — here’s what he’s doing instead

    AI Reviews

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Today’s NYT Connections Hints, Answers for May 11 #1065

    0 Views

    Get ready for the whisper-filled office of the future

    0 Views

    Today’s NYT Strands Hints, Answer and Help for May 11 #799

    0 Views
    Our Picks

    Quantization from the ground up

    David Sacks is done as AI czar — here’s what he’s doing instead

    Judge sides with Anthropic to temporarily block the Pentagon’s ban

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Terms & Conditions
    • Privacy Policy
    • Disclaimer

    © 2026 ainewstoday.co. All rights reserved. Designed by DD.

    Type above and press Enter to search. Press Esc to cancel.