Close Menu
TechurzTechurz

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    I tested a $2,000 medical tablet for research, and it turned out to be a Windows beast

    October 19, 2025

    I found a cheap Windows laptop that I’d actually use for work travel – and it’s on sale

    October 19, 2025

    How you’re charging your tablet is slowly killing it – 3 ways to avoid (and the right method)

    October 19, 2025
    Facebook X (Twitter) Instagram
    Trending
    • I tested a $2,000 medical tablet for research, and it turned out to be a Windows beast
    • I found a cheap Windows laptop that I’d actually use for work travel – and it’s on sale
    • How you’re charging your tablet is slowly killing it – 3 ways to avoid (and the right method)
    • Europol Dismantles SIM Farm Network Powering 49 Million Fake Accounts Worldwide
    • Are high-end Windows laptops worth buying? I tested one from Dell, and it made a statement
    • Walmart is selling a $99 Samsung smartwatch that I actually highly recommend
    • Locked out of your Google account? Now a friend can help – here’s how
    • Every product Apple launched this week: M5 MacBook Pro, iPad, $3,500 Vision Pro, more
    Facebook X (Twitter) Instagram Pinterest Vimeo
    TechurzTechurz
    • Home
    • AI
    • Apps
    • News
    • Guides
    • Opinion
    • Reviews
    • Security
    • Startups
    TechurzTechurz
    Home»Startups»Cloudflare vs. Perplexity: a web scraping war with big implications for AI
    Startups

    Cloudflare vs. Perplexity: a web scraping war with big implications for AI

    TechurzBy TechurzAugust 6, 2025No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    PluggedIn Newsletter logo
    Share
    Facebook Twitter LinkedIn Pinterest Email


    When the web was established several decades ago, it was built on a number of principles. Among them was a key, overarching standard dubbed “netiquette”: Do unto others as you’d want done unto you. It’s a principle that lived on through other companies, including Google, whose motto for a period was “Don’t be evil.”

    The fundamental idea was simple: Act ethically and morally. If someone asked you to stop doing something, you stopped—or at least considered it. But Cloudflare, an IT company that protects millions of websites from hostile internet attacks, has published an eye-opening exposé suggesting that one of the leading AI tools today isn’t following that principle.

    Cloudflare claims Perplexity, an AI-powered “answer engine,” is overriding website requests not to crawl their content by spoofing its identity to hide that the requests are coming from an AI company. Cloudflare launched its investigation after receiving complaints from customers that Perplexity was ignoring directives in robots.txt files, which are used by websites to signal whether they want their content indexed by search engines or AI crawlers.

    Perplexity’s alleged behavior highlights what happens when the web shifts from being rooted in voluntary agreements to a more hard-nosed business environment, where commercial goals overrule moral considerations.

    “The code of honor around crawling and robots.txt files is a charming remnant from when the web was collaborative and based on community standards,” says Eerke Boiten, a cybersecurity researcher at De Montfort University in the U.K. Cloudflare’s position as a market leader in web protection means that, for now at least, it’s still possible to preserve some remnants of that morality, Boiten says.

    Boiten believes the sense of ethical cooperation online is fading fast, noting that many large AI companies show little regard for where or how they obtain their training data, often operating in murky ethical territory. While he sees OpenAI as generally respectful of the established norms, he’s far less optimistic about others. “Perplexity trying to scrape their way around any defenses feels like it will be the norm rather than the exception,” he says.

    Perplexity’s alleged conduct stands out as particularly bold, especially given that the company is already facing a lawsuit over unauthorized content scraping.

    Dow Jones Company—the parent of the Wall Street Journal and New York Post—filed a lawsuit in October 2024, alleging that Perplexity “copies on a massive scale” their content. (The case is ongoing.) The BBC also sent a letter in June to Perplexity CEO Aravind Srinivas, threatening legal action for scraping its content without permission unless the company stops and either compensates for the data already accessed or deletes it entirely. Perplexity told the Financial Times that the BBC’s case was “manipulative and opportunistic” and reflected a “fundamental misunderstanding” of copyright law.

    Perplexity did not respond to Fast Company‘s request for comment on this story. But Boiten, for his part, anticipates an escalating arms race between those trying to protect online content from AI-driven web scraping and the companies attempting to do just that to improve their models. “Cloudflare applying machine learning to spot Perplexity’s patterns, and acknowledging that publication of all this likely means Perplexity will come up with new decoys,” he says.

    Cornell Law professor James Grimmelmann says the legal limits of scraping content without permission—or bypassing robots.txt files—remain unclear, but Cloudflare’s findings could expose Perplexity to more lawsuits.

    “There is a loose judicial consensus that it is okay to scrape sites when their robots.txt files allow it,” says Grimmelmann, “but Perplexity seems determined to fuck around and find out whether the reverse is true.”

    The early-rate deadline for Fast Company’s Most Innovative Companies Awards is Friday, September 5, at 11:59 p.m. PT. Apply today.

    Big Cloudflare Implications Perplexity scraping war Web
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous Article5 Stellar Prime Video Movies to Watch This Week
    Next Article How AI amplifies these other tech trends that matter most to business in 2025
    Techurz
    • Website

    Related Posts

    Security

    A minority of businesses have won big with AI. What are they doing right?

    October 15, 2025
    Security

    Beware of getting your product buying advice from AI for one big reason, says Ziff Davis CEO

    October 14, 2025
    Security

    AI Becomes Russia’s New Cyber Weapon in War on Ukraine

    October 11, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    The Reason Murderbot’s Tone Feels Off

    May 14, 20259 Views

    A Former Apple Luminary Sets Out to Create the Ultimate GPU Software

    September 25, 20258 Views

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    The Reason Murderbot’s Tone Feels Off

    May 14, 20259 Views

    A Former Apple Luminary Sets Out to Create the Ultimate GPU Software

    September 25, 20258 Views

    Start Saving Now: An iPhone 17 Pro Price Hike Is Likely, Says New Report

    August 17, 20258 Views
    Our Picks

    I tested a $2,000 medical tablet for research, and it turned out to be a Windows beast

    October 19, 2025

    I found a cheap Windows laptop that I’d actually use for work travel – and it’s on sale

    October 19, 2025

    How you’re charging your tablet is slowly killing it – 3 ways to avoid (and the right method)

    October 19, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2025 techurz. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.