Close Menu
TechurzTechurz

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    What we’re looking for in Startup Battlefield 2026 and how to put your best application forward

    March 30, 2026

    ScaleOps raises $130M to improve computing efficiency amid AI demand

    March 30, 2026

    Qodo raises $70M for code verification as AI coding scales

    March 30, 2026
    Facebook X (Twitter) Instagram
    Trending
    • What we’re looking for in Startup Battlefield 2026 and how to put your best application forward
    • ScaleOps raises $130M to improve computing efficiency amid AI demand
    • Qodo raises $70M for code verification as AI coding scales
    • Elon Musk’s last co-founder reportedly leaves xAI
    • From Moon hotels to cattle herding: 8 startups investors chased at YC Demo Day
    • Aetherflux reportedly raising Series B at $2 billion valuation
    • OpenAI shuts down Sora while Meta gets shut out in court
    • VCs are betting billions on AI’s next wave, so why is OpenAI killing Sora?
    Facebook X (Twitter) Instagram Pinterest Vimeo
    TechurzTechurz
    • Home
    • AI
    • Apps
    • News
    • Guides
    • Opinion
    • Reviews
    • Security
    • Startups
    TechurzTechurz
    Home»Startups»Cloudflare vs. Perplexity: a web scraping war with big implications for AI
    Startups

    Cloudflare vs. Perplexity: a web scraping war with big implications for AI

    TechurzBy TechurzAugust 6, 2025No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    PluggedIn Newsletter logo
    Share
    Facebook Twitter LinkedIn Pinterest Email


    When the web was established several decades ago, it was built on a number of principles. Among them was a key, overarching standard dubbed “netiquette”: Do unto others as you’d want done unto you. It’s a principle that lived on through other companies, including Google, whose motto for a period was “Don’t be evil.”

    The fundamental idea was simple: Act ethically and morally. If someone asked you to stop doing something, you stopped—or at least considered it. But Cloudflare, an IT company that protects millions of websites from hostile internet attacks, has published an eye-opening exposé suggesting that one of the leading AI tools today isn’t following that principle.

    Cloudflare claims Perplexity, an AI-powered “answer engine,” is overriding website requests not to crawl their content by spoofing its identity to hide that the requests are coming from an AI company. Cloudflare launched its investigation after receiving complaints from customers that Perplexity was ignoring directives in robots.txt files, which are used by websites to signal whether they want their content indexed by search engines or AI crawlers.

    Perplexity’s alleged behavior highlights what happens when the web shifts from being rooted in voluntary agreements to a more hard-nosed business environment, where commercial goals overrule moral considerations.

    “The code of honor around crawling and robots.txt files is a charming remnant from when the web was collaborative and based on community standards,” says Eerke Boiten, a cybersecurity researcher at De Montfort University in the U.K. Cloudflare’s position as a market leader in web protection means that, for now at least, it’s still possible to preserve some remnants of that morality, Boiten says.

    Boiten believes the sense of ethical cooperation online is fading fast, noting that many large AI companies show little regard for where or how they obtain their training data, often operating in murky ethical territory. While he sees OpenAI as generally respectful of the established norms, he’s far less optimistic about others. “Perplexity trying to scrape their way around any defenses feels like it will be the norm rather than the exception,” he says.

    Perplexity’s alleged conduct stands out as particularly bold, especially given that the company is already facing a lawsuit over unauthorized content scraping.

    Dow Jones Company—the parent of the Wall Street Journal and New York Post—filed a lawsuit in October 2024, alleging that Perplexity “copies on a massive scale” their content. (The case is ongoing.) The BBC also sent a letter in June to Perplexity CEO Aravind Srinivas, threatening legal action for scraping its content without permission unless the company stops and either compensates for the data already accessed or deletes it entirely. Perplexity told the Financial Times that the BBC’s case was “manipulative and opportunistic” and reflected a “fundamental misunderstanding” of copyright law.

    Perplexity did not respond to Fast Company‘s request for comment on this story. But Boiten, for his part, anticipates an escalating arms race between those trying to protect online content from AI-driven web scraping and the companies attempting to do just that to improve their models. “Cloudflare applying machine learning to spot Perplexity’s patterns, and acknowledging that publication of all this likely means Perplexity will come up with new decoys,” he says.

    Cornell Law professor James Grimmelmann says the legal limits of scraping content without permission—or bypassing robots.txt files—remain unclear, but Cloudflare’s findings could expose Perplexity to more lawsuits.

    “There is a loose judicial consensus that it is okay to scrape sites when their robots.txt files allow it,” says Grimmelmann, “but Perplexity seems determined to fuck around and find out whether the reverse is true.”

    The early-rate deadline for Fast Company’s Most Innovative Companies Awards is Friday, September 5, at 11:59 p.m. PT. Apply today.

    Big Cloudflare Implications Perplexity scraping war Web
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous Article5 Stellar Prime Video Movies to Watch This Week
    Next Article How AI amplifies these other tech trends that matter most to business in 2025
    Techurz
    • Website

    Related Posts

    Opinion

    Fundamental raises $255 million Series A with a new take on big data analysis

    February 5, 2026
    Opinion

    AI security startup Outtake raises $40M from Iconiq, Satya Nadella, Bill Ackman and other big names

    January 28, 2026
    Opinion

    Rogue agents and shadow AI: Why VCs are betting big on AI security

    January 19, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    College social app Fizz expands into grocery delivery

    September 3, 20252,288 Views

    A Former Apple Luminary Sets Out to Create the Ultimate GPU Software

    September 25, 202516 Views

    The Reason Murderbot’s Tone Feels Off

    May 14, 202512 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    College social app Fizz expands into grocery delivery

    September 3, 20252,288 Views

    A Former Apple Luminary Sets Out to Create the Ultimate GPU Software

    September 25, 202516 Views

    The Reason Murderbot’s Tone Feels Off

    May 14, 202512 Views
    Our Picks

    What we’re looking for in Startup Battlefield 2026 and how to put your best application forward

    March 30, 2026

    ScaleOps raises $130M to improve computing efficiency amid AI demand

    March 30, 2026

    Qodo raises $70M for code verification as AI coding scales

    March 30, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2026 techurz. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.