What is Common Crawl?

Common Crawl is a nonprofit organization that provides free web crawl data for research and AI training.

Why did Digital Content Next send a cease and desist letter?

The letter alleges Common Crawl violates copyright by scraping and distributing publisher content without authorization.

Which publishers are represented by Digital Content Next?

Digital Content Next represents major U.S. digital publishers including The New York Times, The Wall Street Journal, and The Washington Post.

US Publishers Demand Common Crawl Stop Scraping

Digital Content Next sent a cease and desist letter to Common Crawl Foundation demanding it stop scraping publisher content.

June 10, 2026 1 min read 114 views Source: searchenginejournal.com

Digital Content Next, a trade body representing U.S. digital publishers, has sent a cease and desist letter to the Common Crawl Foundation, demanding it stop scraping publisher content and remove material already in its datasets, according to a report by Reuters on June 9, 2026.

The letter, dated June 8, 2026, alleges that Common Crawl's web crawling activities violate copyright laws by systematically collecting and distributing copyrighted content without authorization. Common Crawl, a nonprofit that provides free web crawl data for research and AI training, has not yet publicly responded to the letter as of June 10, 2026.

This action follows growing tensions between content creators and AI developers over the use of scraped data for training large language models. Digital Content Next represents major publishers including The New York Times, The Wall Street Journal, and The Washington Post.

The dispute highlights the ongoing legal and ethical debates around web scraping for AI training, with publishers seeking compensation or opt-out mechanisms for their content. Common Crawl's datasets have been widely used by companies like OpenAI and Google for training AI systems.

US Publishers Demand Common Crawl Stop Scraping

❓ Frequently Asked Questions

Related Articles

XERON to Showcase PC Hardware at Gamescom 2026

XBAND: Health Tracker That Pairs with Your Own Watch

AI's Most Valuable Skill: Data Labeling

5 Electric SUVs That Outrun the Mustang Mach-E