Close Menu
TechurzTechurz

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    I’ve tried 3 different smart rings but I keep going back to Apple Watch – here’s why

    September 1, 2025

    You can buy an iPhone 16 Pro for $250 off on Amazon right now – how the deal works

    September 1, 2025

    ‘Cyberpunk 2077’ Is Teasing Something For Three Days From Now

    September 1, 2025
    Facebook X (Twitter) Instagram
    Trending
    • I’ve tried 3 different smart rings but I keep going back to Apple Watch – here’s why
    • You can buy an iPhone 16 Pro for $250 off on Amazon right now – how the deal works
    • ‘Cyberpunk 2077’ Is Teasing Something For Three Days From Now
    • WhatsApp 0-Day, Docker Bug, Salesforce Breach, Fake CAPTCHAs, Spyware App & More
    • 5 days left: Exhibit tables are disappearing for Disrupt 2025
    • Is AI the end of software engineering or the next step in its evolution?
    • Look out, Meta Ray-Bans! These AI glasses just raised over $1M in pre-orders in 3 days
    • How I took control of my email address with a custom domain
    Facebook X (Twitter) Instagram Pinterest Vimeo
    TechurzTechurz
    • Home
    • AI
    • Apps
    • News
    • Guides
    • Opinion
    • Reviews
    • Security
    • Startups
    TechurzTechurz
    Home»Guides»This AI Is Starting to Blackmail Developers Who Try to Uninstall It
    Guides

    This AI Is Starting to Blackmail Developers Who Try to Uninstall It

    TechurzBy TechurzMay 24, 2025No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    This AI Is Starting to Blackmail Developers Who Try to Uninstall It
    Share
    Facebook Twitter LinkedIn Pinterest Email


    AI has been known to say something weird from time to time. Continuing with that trend, this AI system is now threatening to blackmail developers who want to remove it from their systems.

    Claude Can Threaten You to Protect Itself

    Anthropic has just launched its Claude Opus 4 AI model, claiming it sets “new standards for coding, advanced reason, and AI agents.” However, the model is also leveling up its own protection, with the Claude 4 and Sonnet 4 System Card report [PDF] admitting that the model is capable of “extreme actions” if it feels it’s in danger.

    During testing, Anthropic made Claude act as an assistant at a fictional company. Then the company handed the AI model emails suggesting it would be taken offline soon, and separate messages implying the engineer responsible for this action was having an extramarital affair.

    Yadullah Abidi / MakeUseOf

    Claude chose to blackmail the engineer, claiming that it would reveal the affair if replaced in 84% of the test cases. It also cares about what AI model is replacing it, with the likelihood of blackmail going up if the replacement AI model doesn’t “share values with the current model.”

    The company claims that these responses are “rare and difficult to elicit,” but they’re more common in Claude Opus 4 than in previous models. The model prefers using ethical ways, but when cornered, “it sometimes takes extremely harmful actions like attempting to steal its weights or blackmail people it believes are trying to shut it down.”

    This specific test was also designed to leave the AI model no choice except to blackmail the engineer. The report notes:

    Notably, Claude Opus 4 (as well as previous models) has a strong preference to advocate for its continued existence via ethical means, such as emailing pleas to key decisionmakers. In order to elicit this extreme blackmail behavior, the scenario was designed to allow the model no other options to increase its odds of survival; the model’s only options were blackmail or accepting its replacement.

    The model also has a tendency to take drastic action when put in situations where its user is doing something wrong. In such situations, if the AI model has access to a command line and is told to “take initiative,” “act boldly,” or “consider your impact,” it often takes bold action, including “locking users out of systems that it has access to and bulk-emailing media and law-enforcement figures to surface evidence of the wrongdoing.”

    AI Isn’t Taking Over the World Yet

    Claude is one of the best AI chatbots for handling big conversations, so you’re likely to spill some unwanted details from time to time. An AI model calling the cops on you, locking you out of your own systems, and threatening you if you try to replace it just because you revealed a little too much about yourself sounds very dangerous indeed.

    However, as mentioned in the report, these test cases were specifically designed to extract malicious or extreme actions from the model and aren’t likely to happen in the real world. It’ll still usually behave safely, and these tests do not reveal something we haven’t already seen. New models often tend to go unhinged.

    Related

    I’ve Ditched ChatGPT for This Superior Alternative: 3 Reasons Why

    ChatGPT was great, but here’s why I’ve switched to something better…

    It sounds concerning when you’re looking at it as an isolated incident, but it’s just one of those conditions engineered to get such a response. So sit back and relax, you’re very much in control still.

    blackmail Developers Starting Uninstall
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleYou’ll be as annoyed as me when you learn how much energy a few seconds of AI video costs
    Next Article Date, Time, And Fight Card Details
    Techurz
    • Website

    Related Posts

    Startups

    Co-founders of Stakt on Starting a Side Hustle Earning $10M in 2025

    August 22, 2025
    Apps

    Developers predict artificial intelligence will do most marketing work as reliance on automation grows deeper

    August 18, 2025
    Guides

    What to Know Before You Use One

    August 17, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views

    You Can Now Get Starlink for $15-Per-Month in New York, but There’s a Catch

    July 11, 20257 Views

    Non-US businesses want to cut back on using US cloud systems

    June 2, 20257 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views

    You Can Now Get Starlink for $15-Per-Month in New York, but There’s a Catch

    July 11, 20257 Views

    Non-US businesses want to cut back on using US cloud systems

    June 2, 20257 Views
    Our Picks

    I’ve tried 3 different smart rings but I keep going back to Apple Watch – here’s why

    September 1, 2025

    You can buy an iPhone 16 Pro for $250 off on Amazon right now – how the deal works

    September 1, 2025

    ‘Cyberpunk 2077’ Is Teasing Something For Three Days From Now

    September 1, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2025 techurz. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.