
A $10 billion+ lawsuit has been filed by nine
newspapers, accusing OpenAI and Microsoft of violating their copyrights for the purpose of training large language models.
The case, one in a long line of such actions, was filed
Wednesday by California Newspapers Partnership, Prairie Mountain Publishing Company LLP; MNG-BH Acquisition LLC; Hartford Courant Company, LLC; The Daily Press LLC; The Morning Call, LLC;
Virginia-Pilot Media, LLC; Los Angeles Daily News Publishing Companies; and the San Diego Union-Tribune, LLC.
The complaint argues that, “unlike a traditional search result, the
synthetic output does not include a prominent hyperlink that sends users to the Publishers’ website. Rather, the output disguises the results as the work of the GPT system itself.”
It continues, “There is no question that the Defendants’ models have ‘memorized’ the pilfered copies of the authors’ and publishers’ copyrighted works. And in
order to remain current, the Defendants cannot rely just on the content they stole in the past – they have to update their models regularly with new material so they can provide their users with
the latest information.”
advertisement
advertisement
The complaint adds, “Hundreds of thousands of the Publishers’ Works were copied and ingested – multiple times – for the purpose
of 'training' Defendants’ GPT models."
In addition, the complaint notes that “the publishers of nine regional newspapers join the long list of publishers and authors who have
filed lawsuits against OpenAI, Microsoft, and other AI companies. Most of these lawsuits have been consolidated in this Court, and have survived motions to dismiss largely intact.”
The case is on file with the U.S. District Court for the Southern District of New York.