Close Menu
TechurzTechurz
    What's Hot

    Sarvam becomes India’s newest AI unicorn with $234 million funding round led by HCLTech

    June 15, 2026

    As AI agents become employees, NewCore emerges with $66M to give them identities

    June 15, 2026

    Orbio raises $21 million to automate hiring and onboarding for frontline workers

    June 15, 2026
    X (Twitter) Pinterest YouTube LinkedIn WhatsApp
    Tech Pulse
    • Sarvam becomes India’s newest AI unicorn with $234 million funding round led by HCLTech
    • As AI agents become employees, NewCore emerges with $66M to give them identities
    • Orbio raises $21 million to automate hiring and onboarding for frontline workers
    • As AI companies race to go public, who else is along for the ride?
    • As Anthropic suspends access to new models, India debates its AI future
    X (Twitter) Pinterest YouTube LinkedIn WhatsApp
    TechurzTechurz
    • Home
    • Tech Pulse
    • Future Tech
    • AI Systems
    • Cyber Reality
    • Disruption Lab
    • Signals
    TechurzTechurz
    Home - Guides - This AI Is Starting to Blackmail Developers Who Try to Uninstall It
    Guides

    This AI Is Starting to Blackmail Developers Who Try to Uninstall It

    TechurzBy TechurzMay 24, 2025Updated:May 12, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    This AI Is Starting to Blackmail Developers Who Try to Uninstall It
    Share
    Facebook Twitter LinkedIn Pinterest Email


    AI has been known to say something weird from time to time. Continuing with that trend, this AI system is now threatening to blackmail developers who want to remove it from their systems.

    Claude Can Threaten You to Protect Itself

    Anthropic has just launched its Claude Opus 4 AI model, claiming it sets “new standards for coding, advanced reason, and AI agents.” However, the model is also leveling up its own protection, with the Claude 4 and Sonnet 4 System Card report [PDF] admitting that the model is capable of “extreme actions” if it feels it’s in danger.

    During testing, Anthropic made Claude act as an assistant at a fictional company. Then the company handed the AI model emails suggesting it would be taken offline soon, and separate messages implying the engineer responsible for this action was having an extramarital affair.

    Yadullah Abidi / MakeUseOf

    Claude chose to blackmail the engineer, claiming that it would reveal the affair if replaced in 84% of the test cases. It also cares about what AI model is replacing it, with the likelihood of blackmail going up if the replacement AI model doesn’t “share values with the current model.”

    The company claims that these responses are “rare and difficult to elicit,” but they’re more common in Claude Opus 4 than in previous models. The model prefers using ethical ways, but when cornered, “it sometimes takes extremely harmful actions like attempting to steal its weights or blackmail people it believes are trying to shut it down.”

    This specific test was also designed to leave the AI model no choice except to blackmail the engineer. The report notes:

    Notably, Claude Opus 4 (as well as previous models) has a strong preference to advocate for its continued existence via ethical means, such as emailing pleas to key decisionmakers. In order to elicit this extreme blackmail behavior, the scenario was designed to allow the model no other options to increase its odds of survival; the model’s only options were blackmail or accepting its replacement.

    The model also has a tendency to take drastic action when put in situations where its user is doing something wrong. In such situations, if the AI model has access to a command line and is told to “take initiative,” “act boldly,” or “consider your impact,” it often takes bold action, including “locking users out of systems that it has access to and bulk-emailing media and law-enforcement figures to surface evidence of the wrongdoing.”

    AI Isn’t Taking Over the World Yet

    Claude is one of the best AI chatbots for handling big conversations, so you’re likely to spill some unwanted details from time to time. An AI model calling the cops on you, locking you out of your own systems, and threatening you if you try to replace it just because you revealed a little too much about yourself sounds very dangerous indeed.

    However, as mentioned in the report, these test cases were specifically designed to extract malicious or extreme actions from the model and aren’t likely to happen in the real world. It’ll still usually behave safely, and these tests do not reveal something we haven’t already seen. New models often tend to go unhinged.

    Related

    I’ve Ditched ChatGPT for This Superior Alternative: 3 Reasons Why

    ChatGPT was great, but here’s why I’ve switched to something better…

    It sounds concerning when you’re looking at it as an isolated incident, but it’s just one of those conditions engineered to get such a response. So sit back and relax, you’re very much in control still.

    blackmail Developers Starting Uninstall
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleYou’ll be as annoyed as me when you learn how much energy a few seconds of AI video costs
    Next Article Date, Time, And Fight Card Details
    Techurz
    • Website

    Related Posts

    Opinion

    Former Tesla product manager wants to make luxury goods impossible to fake, starting with a chip

    February 10, 2026
    Opinion

    Whole Foods to install smart food waste bins from Mill starting in 2027

    December 16, 2025
    Opinion

    Runware raises $50M Series A to help make image, video generation easier for developers

    December 11, 2025
    Add A Comment
    Latest Tech Pulse

    College social app Fizz expands into grocery delivery

    September 3, 20252,289

    SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws major VC interest

    May 23, 202621

    Future of Digital Privacy and Security: 7 Truths Nobody Tells You

    May 25, 202618
    Stay In Touch
    • YouTube
    • WhatsApp
    • Twitter
    • Pinterest
    • LinkedIn

    Techurz helps readers stay ahead of digital change with clear, practical, future focused technology intelligence written today,searched tomorrow.

    X (Twitter) Pinterest YouTube LinkedIn WhatsApp
    Company
    • About Us
    • Contact Us
    • Our Authors / Editorial Team
    • Write For Us
    • Advertise
    Policy
    • Editorial Policy
    • Privacy Policy
    • Terms and Conditions
    • Affiliate Disclosure
    • Cookie Policy
    • Disclaimer
    • DMCA
    Explore
    • AI Systems
    • Cyber Reality
    • Future Tech
    • Disruption Lab
    • Signals
    • Tech Pulse
    • Sitemap

    Join the Techurz Brief

    The future does not arrive suddenly.
    Stay ahead with fast, sharp tech signals.

    Type above and press Enter to search. Press Esc to cancel.