Close Menu
TechurzTechurz
    What's Hot

    Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on

    June 27, 2026

    Corgi, the buzzy Y Combinator-backed insurance tech startup, says it didn’t steal an open source product

    June 26, 2026

    OpenAI poaches Uber India chief to lead its biggest market outside the US

    June 26, 2026
    X (Twitter) Pinterest YouTube LinkedIn WhatsApp
    Tech Pulse
    • Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on
    • Corgi, the buzzy Y Combinator-backed insurance tech startup, says it didn’t steal an open source product
    • OpenAI poaches Uber India chief to lead its biggest market outside the US
    • Early Bird pricing ends tonight for Founder Summit
    • Robotaxis drive miles just to get cleaned and charged; this new startup wants to fix that
    X (Twitter) Pinterest YouTube LinkedIn WhatsApp
    TechurzTechurz
    • Home
    • Tech Pulse
    • Future Tech
    • AI Systems
    • Cyber Reality
    • Disruption Lab
    • Signals
    TechurzTechurz
    Home - Disruption Lab - Cloudflare vs. Perplexity: a web scraping war with big implications for AI
    Disruption Lab

    Cloudflare vs. Perplexity: a web scraping war with big implications for AI

    TechurzBy TechurzAugust 6, 2025Updated:May 11, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    PluggedIn Newsletter logo
    Share
    Facebook Twitter LinkedIn Pinterest Email


    When the web was established several decades ago, it was built on a number of principles. Among them was a key, overarching standard dubbed “netiquette”: Do unto others as you’d want done unto you. It’s a principle that lived on through other companies, including Google, whose motto for a period was “Don’t be evil.”

    The fundamental idea was simple: Act ethically and morally. If someone asked you to stop doing something, you stopped—or at least considered it. But Cloudflare, an IT company that protects millions of websites from hostile internet attacks, has published an eye-opening exposé suggesting that one of the leading AI tools today isn’t following that principle.

    Cloudflare claims Perplexity, an AI-powered “answer engine,” is overriding website requests not to crawl their content by spoofing its identity to hide that the requests are coming from an AI company. Cloudflare launched its investigation after receiving complaints from customers that Perplexity was ignoring directives in robots.txt files, which are used by websites to signal whether they want their content indexed by search engines or AI crawlers.

    Perplexity’s alleged behavior highlights what happens when the web shifts from being rooted in voluntary agreements to a more hard-nosed business environment, where commercial goals overrule moral considerations.

    “The code of honor around crawling and robots.txt files is a charming remnant from when the web was collaborative and based on community standards,” says Eerke Boiten, a cybersecurity researcher at De Montfort University in the U.K. Cloudflare’s position as a market leader in web protection means that, for now at least, it’s still possible to preserve some remnants of that morality, Boiten says.

    Boiten believes the sense of ethical cooperation online is fading fast, noting that many large AI companies show little regard for where or how they obtain their training data, often operating in murky ethical territory. While he sees OpenAI as generally respectful of the established norms, he’s far less optimistic about others. “Perplexity trying to scrape their way around any defenses feels like it will be the norm rather than the exception,” he says.

    Perplexity’s alleged conduct stands out as particularly bold, especially given that the company is already facing a lawsuit over unauthorized content scraping.

    Dow Jones Company—the parent of the Wall Street Journal and New York Post—filed a lawsuit in October 2024, alleging that Perplexity “copies on a massive scale” their content. (The case is ongoing.) The BBC also sent a letter in June to Perplexity CEO Aravind Srinivas, threatening legal action for scraping its content without permission unless the company stops and either compensates for the data already accessed or deletes it entirely. Perplexity told the Financial Times that the BBC’s case was “manipulative and opportunistic” and reflected a “fundamental misunderstanding” of copyright law.

    Perplexity did not respond to Fast Company‘s request for comment on this story. But Boiten, for his part, anticipates an escalating arms race between those trying to protect online content from AI-driven web scraping and the companies attempting to do just that to improve their models. “Cloudflare applying machine learning to spot Perplexity’s patterns, and acknowledging that publication of all this likely means Perplexity will come up with new decoys,” he says.

    Cornell Law professor James Grimmelmann says the legal limits of scraping content without permission—or bypassing robots.txt files—remain unclear, but Cloudflare’s findings could expose Perplexity to more lawsuits.

    “There is a loose judicial consensus that it is okay to scrape sites when their robots.txt files allow it,” says Grimmelmann, “but Perplexity seems determined to fuck around and find out whether the reverse is true.”

    The early-rate deadline for Fast Company’s Most Innovative Companies Awards is Friday, September 5, at 11:59 p.m. PT. Apply today.

    Big Cloudflare Implications Perplexity scraping war Web
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous Article5 Stellar Prime Video Movies to Watch This Week
    Next Article How AI amplifies these other tech trends that matter most to business in 2025
    Techurz
    • Website

    Related Posts

    Opinion

    World model maker Odyssey nabs $1.45B valuation backed by Amazon and other big names

    June 17, 2026
    Opinion

    Andrew Yang thinks the next big startup opportunity is lowering the cost of living

    June 13, 2026
    Opinion

    Datadog veterans launch AI coding startup Niteshift on a bet against Big AI lock-in

    June 10, 2026
    Add A Comment
    Latest Tech Pulse

    College social app Fizz expands into grocery delivery

    September 3, 20252,290

    SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws major VC interest

    May 23, 202622

    Future of Digital Privacy and Security: 7 Truths Nobody Tells You

    May 25, 202619
    Stay In Touch
    • YouTube
    • WhatsApp
    • Twitter
    • Pinterest
    • LinkedIn

    Techurz helps readers stay ahead of digital change with clear, practical, future focused technology intelligence written today,searched tomorrow.

    X (Twitter) Pinterest YouTube LinkedIn WhatsApp
    Company
    • About Us
    • Contact Us
    • Our Authors / Editorial Team
    • Write For Us
    • Advertise
    Policy
    • Editorial Policy
    • Privacy Policy
    • Terms and Conditions
    • Affiliate Disclosure
    • Cookie Policy
    • Disclaimer
    • DMCA
    Explore
    • AI Systems
    • Cyber Reality
    • Future Tech
    • Disruption Lab
    • Signals
    • Tech Pulse
    • Sitemap

    Join the Techurz Brief

    The future does not arrive suddenly.
    Stay ahead with fast, sharp tech signals.

    Type above and press Enter to search. Press Esc to cancel.