Reddit To Block Wayback Machine In Effort To Prevent Content Scraping


Reddit is restricting access to its content by the Internet Archive’s Wayback Machine.

The Wayback Machine will now be allowed to index the Reddit home page, but not the majority of Reddit content, the Verge reports.

Reddit came to this decision after determining that it was being victimized by content scrapers supported by artificial intelligence. 

”Internet Archive provides a service to the open web, but we’ve been made aware of instances where AI companies violate platform policies, including ours, and scrape data from the Wayback Machine,” Reddit spokesperson Tim Rathschmidt told The Verge.

advertisement

advertisement

Rathschmidt continued, “Until they’re able to defend their site and comply with platform policies (e.g., respecting user privacy, re: deleting removed content) we’re limiting some of their access to Reddit data to protect redditors.”

Reddit has signed a content deal with OpenAI, but has sued Anthropic for alleged scraping, The Verge adds. 

 

 

Next story loading loading..