What happened
Perplexity scraping controversy is at the center of this update. Cloudflare, a leading internet infrastructure provider, publicly accused Perplexity of crawling and scraping websites despite those sites implementing technical blocks to forbid AI scraping. This means Perplexity allegedly bypassed explicit instructions from website owners who aimed to protect their content from unauthorized data collection.
Why it matters
This situation underscores pressing issues around how AI companies source training data. The quality and legality of data are critical to AI product development and public trust. If AI firms disregard website owners’ restrictions, it could spark legal challenges, damage reputations, and provoke calls for stricter regulation. For Perplexity, a relatively new competitor in AI-powered search, such controversies risk undermining its credibility in a space dominated by giants like Google and OpenAI.
Context
Perplexity aims to disrupt traditional search by integrating AI-powered answers, competing with ChatGPT and Google Search. Accessing comprehensive and authoritative web data is vital to its service quality. However, many publishers have technical measures to prevent scraping to protect copyrights and control content distribution. Cloudflare’s detection carries weight since it supports security for millions of websites.
Expected impact
This revelation could trigger increased scrutiny on AI data collection methods, encouraging clearer industry standards or regulatory frameworks. It may also influence how startups balance aggressive data acquisition with respect for content owners. Perplexity might face reputational and operational consequences depending on the fallout.
What we still do not know
Details on the scale of Perplexity’s scraping, their internal policies, and whether formal investigations will ensue remain undisclosed. How other AI companies manage similar challenges is also unclear.
Related coverage: AI Chronicle analysis and updates.

Apple Gains Full Access to Google’s Gemini AI to Enhance Siri with Lightweight On-Device Models
Global AI Show Abu Dhabi 2025 Highlights Innovation and Leadership in Artificial Intelligence
Google Launches TranslateGemma: Advanced AI Translation Across 55 Languages
Lego Launches Smart Bricks Featuring AI-Driven Interaction Without Screens