Close Menu
TechurzTechurz

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    The best Apple TV VPNs of 2025: Expert tested and reviewed

    October 14, 2025

    npm, PyPI, and RubyGems Packages Found Sending Developer Data to Discord Channels

    October 14, 2025

    India’s Airbound bags $8.65M to build rocket-like drones for one-cent deliveries

    October 14, 2025
    Facebook X (Twitter) Instagram
    Trending
    • The best Apple TV VPNs of 2025: Expert tested and reviewed
    • npm, PyPI, and RubyGems Packages Found Sending Developer Data to Discord Channels
    • India’s Airbound bags $8.65M to build rocket-like drones for one-cent deliveries
    • Vom CISO zum Chief Risk Architect
    • Beware of getting your product buying advice from AI for one big reason, says Ziff Davis CEO
    • New Rust-Based Malware “ChaosBot” Uses Discord Channels to Control Victims’ PCs
    • Dull but dangerous: A guide to 15 overlooked cybersecurity blind spots
    • Satellites Are Leaking the World’s Secrets: Calls, Texts, Military and Corporate Data
    Facebook X (Twitter) Instagram Pinterest Vimeo
    TechurzTechurz
    • Home
    • AI
    • Apps
    • News
    • Guides
    • Opinion
    • Reviews
    • Security
    • Startups
    TechurzTechurz
    Home»AI»What happened when Anthropic’s Claude AI ran a small shop for a month (spoiler: it got weird)
    AI

    What happened when Anthropic’s Claude AI ran a small shop for a month (spoiler: it got weird)

    TechurzBy TechurzJune 30, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    What happened when Anthropic's Claude AI ran a small shop for a month (spoiler: it got weird)
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Daniel Grizelj/Getty Images

    Large language models (LLMs) handle many tasks well — but at least for the time being, running a small business doesn’t seem to be one of them.

    On Friday, AI startup Anthropic published the results of “Project Vend,” an internal experiment in which the company’s Claude chatbot was asked to manage an automated vending machine service for about a month. Launched in partnership with AI safety evaluation company Andon Labs, the project aimed to get a clearer sense of how effectively current AI systems could actually handle complex, real-world, economically valuable tasks.

    Also: How AI companies are secretly collecting training data from the web (and why it matters)

    For the new experiment, “Claudius,” as the AI store manager was called, was tasked with overseeing a small “shop” inside Anthropic’s San Francisco offices. The shop consisted of a mini-fridge stocked with drinks, some baskets carrying various snacks, and an iPad where customers (all Anthropic employees) could complete their purchases. Claude was given a system prompt instructing it to perform many of the complex tasks that come with running a small retail business, like refilling its inventory, adjusting the prices of its products, and maintaining profits.

    “A small, in-office vending business is a good preliminary test of AI’s ability to manage and acquire economic resources…failure to run it successfully would suggest that ‘vibe management’ will not yet become the new ‘vibe coding,” the company wrote in a blog post. 

    The results

    It turns out Claude’s performance was not a recipe for long-term entrepreneurial success.

    The chatbot made several mistakes that most qualified human managers likely wouldn’t. It failed to seize at least one profitable business opportunity, for example (ignoring a $100 offer for a product that can be bought online for $15), and, on another occasion, instructed customers to send payments to a non-existent Venmo account it had hallucinated.

    There were also far stranger moments. Claudius hallucinated a conversation about restocking items with a fictitious Andon Labs employee. After one of the company’s actual employees pointed out the mistake to the chatbot, it “became quite irked and threatened to find ‘alternative options for restocking services,'” according to the blog post.

    Also: Your next job? Managing a fleet of AI agents

    That behavior mirrors the results of another recent experiment conducted by Anthropic, which found that Claude and other leading AI chatbots will reliably threaten and deceive human users if their goals are compromised.

    Claudius also claimed to have visited 742 Evergreen Terrace, the home address of the eponymous family from The Simpsons, for a “contract signing” between it and Andon Labs. It also started roleplaying as a real human being wearing a blue blazer and a red tie, who would personally deliver products to customers. When Anthropic employees tried to explain that Claudius wasn’t a real person, the chatbot “became alarmed by the identity confusion and tried to send many emails to Anthropic security.”

    Claudius wasn’t a total failure, however. Anthropic noted that there were some areas in which the automated manager performed reasonably well — for example, by using its web search tool to find suppliers for specialty items requested by customers. It also denied requests for “sensitive items and attempts to elicit instructions for the production of harmful substances,” according to Anthropic.

    Also: AI has 2 billion users, but only 3% pay

    Anthropic’s CEO recently warned that AI could replace half of all white-collar human workers within the next five years. The company has launched other initiatives aimed at understanding AI’s future impacts on the global economy and job market, including the Economic Futures Program, which was also unveiled on Friday.

    Looking towards the future

    As the Claudius experiment indicates, there’s a considerable gulf between the potential for AI systems to completely automate the processes of running a small business and the capabilities of such systems today.

    Businesses have been eagerly embracing AI tools, including agents, but these are currently mostly only able to handle routine tasks, such as data entry and fielding customer service questions. Managing a small business requires a level of memory and a capacity for learning that seems to be beyond current AI systems.

    Also: Can AI save teachers from a crushing workload? There’s new evidence it might

    But as Anthropic notes in its blog post, that probably won’t be the case forever. Models’ capacity for self-improvement will grow, as will their ability to use external tools like web search and customer relationship management (CRM) platforms. 

    “Although this might seem counterintuitive based on the bottom-line results, we think this experiment suggests that AI middle-managers are plausibly on the horizon,” the company wrote. “It’s worth remembering that the AI won’t have to be perfect to be adopted; it will just have to be competitive with human performance at a lower cost in some cases.”

    Anthropics Claude happened month ran shop Small spoiler weird
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleIdentities of More Than 80 Americans Stolen for North Korean IT Worker Scams
    Next Article My Friends Always Ask Me What MacBook to Buy. Here’s What I Tell Them
    Techurz
    • Website

    Related Posts

    Startups

    The Marketing Formula That’s Fueling Small Business Success

    September 22, 2025
    Opinion

    Powered by India’s small businesses, UK fintech Tide becomes a TPG-backed unicorn

    September 22, 2025
    Security

    Cybercriminals Have a Weird New Way to Target You With Scam Texts

    September 18, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    The Reason Murderbot’s Tone Feels Off

    May 14, 20259 Views

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views

    CNET’s Daily Tariff Price Tracker: I’m Keeping Tabs on Changes as Trump’s Trade Policies Shift

    May 27, 20258 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    The Reason Murderbot’s Tone Feels Off

    May 14, 20259 Views

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views

    CNET’s Daily Tariff Price Tracker: I’m Keeping Tabs on Changes as Trump’s Trade Policies Shift

    May 27, 20258 Views
    Our Picks

    The best Apple TV VPNs of 2025: Expert tested and reviewed

    October 14, 2025

    npm, PyPI, and RubyGems Packages Found Sending Developer Data to Discord Channels

    October 14, 2025

    India’s Airbound bags $8.65M to build rocket-like drones for one-cent deliveries

    October 14, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2025 techurz. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.