Loading...

Anthropic Hit With New Lawsuit From 100+ Authors Over Alleged Data Theft

News-style illustration showing a large folder labeled “Anthropic” stamped with “Lawsuit,” surrounded by books, manuscripts, and literary documents. A broken padlock marked “Alleged Data Theft” rests on an open book, symbolizing claims of unauthorized use of copyrighted works. On the right side, the headline reads “Anthropic Hit With New Lawsuit” with supporting text referencing more than 100 authors alleging their books were used to train AI models without permission or compensation. The design features a bold blue background with white, yellow, and red accents.

Anthropic Hit With New Lawsuit From 100+ Authors Over Alleged Data Theft

Artificial intelligence company Anthropic is facing another legal challenge over its data collection practices. More than 100 authors filed a copyright lawsuit against the AI developer on June 17, 2026. The complaint was filed in the US District Court for the Northern District of California. According to reporting by Times Now News, the authors accuse Anthropic of using their copyrighted books without permission to train its AI models. The group is led by Thomas William Shakespeare along with other writers who opted out of a previous settlement. This lawsuit differs from earlier cases by focusing on how Anthropic obtained the data rather than just how it was used. The legal action puts Anthropic back in the spotlight over long-standing debates about copyright and artificial intelligence.

Authors Claim Anthropic Used 500 Copyrighted Works Without License

According to the complaint, the authors allege that Anthropic used around 500 copyrighted works without obtaining any valid license. The legal filing names CEO Dario Amodei and co-founder Benjamin Mann as defendants in the case. The authors claim that Anthropic obtained millions of books from sources linked to piracy. These include datasets known as Pirate Library Mirror and Library Genesis. The complaint argues that Anthropic used this pirated material to build its Claude AI model. The authors maintain that the company has not compensated them for the use of their work. The central concern raised in the lawsuit revolves around Anthropic using copyrighted material for commercial AI development without proper authorization or payment to the original creators.

New Legal Strategy Targets Dataset Procurement Through BitTorrent

This lawsuit takes a different approach from previous copyright cases against AI companies. According to TechJackSolutions, the complaint targets how Anthropic obtained datasets through BitTorrent rather than challenging the model training itself. This distinction is legally significant because it creates a procurement-phase infringement theory. If successful, this argument would bypass the transformative use defense that AI companies often rely on in training-related cases. The lawsuit does not argue about whether training AI on copyrighted books is fair use. Instead, it focuses on the alleged illegal acquisition of those books through piracy channels. The legal theory shifts the battlefield from what Anthropic did with the data to how it acquired the data in the first place.

Previous Bartz Case Led to $1.5 Billion Anthropic Settlement

This is not the first time Anthropic has faced legal action over its data practices. The Bartz v. Anthropic case previously made headlines when authors claimed the company trained its Claude AI model using pirated books. In that case, the court ruled that training AI on legally obtained books could be considered fair use. However, the court found that building a library from pirated copies was a separate and problematic issue. Following the legal proceedings, the Bartz case eventually ended with Anthropic agreeing to pay a $1.5 billion settlement to the affected authors. The latest lawsuit involves a different group of authors who chose to opt out of that previous settlement. These authors are now pursuing their own legal action against the AI company.

Authors Seek Up to $71.4 Million in Statutory Damages

The group of authors is reportedly seeking maximum statutory damages of up to $71.4 million. This figure remains unconfirmed as the complete complaint text has not been verified by all reporting sources. The requested amount reflects the authors' claims that Anthropic used their copyrighted works without proper authorization. Statutory damages in copyright cases can range significantly depending on whether the infringement was willful. The lawsuit signals that the authors are seeking substantial financial compensation for the alleged unauthorized use of their books. The damages sought would be in addition to any other remedies the court might order if the plaintiffs prevail in their case against Anthropic.

Claude AI Training Data Sources Include LibGen and Pirate Library Mirror

The lawsuit specifically identifies several datasets that Anthropic allegedly used to train its Claude AI models. According to the complaint, the company obtained millions of books from sources linked to piracy, including Library Genesis (LibGen) and Pirate Library Mirror. These repositories are known in academic and tech circles for hosting large collections of copyrighted material without authorization. The authors argue that Anthropic knowingly used these pirated sources to build its training data. This differs from companies that license books from publishers or use openly available content. The complaint suggests that Anthropic was aware of the copyright status of these materials but proceeded to use them anyway to develop its commercial AI products.

Transformative Use Defense Becomes Legally Irrelevant Under This Theory

The legal strategy behind this lawsuit could have far-reaching implications for the AI industry. Under the procurement-phase infringement theory, fair use arguments built around transformative training become legally irrelevant for this class of claim. AI companies often defend themselves in copyright cases by arguing that training models on copyrighted works constitutes transformative use. This defense typically focuses on what the AI does with the data rather than how it got the data. By shifting the focus to the acquisition method, the authors are attempting to close a legal loophole. If this approach succeeds, it could make it much harder for AI companies to defend against copyright claims based on how they sourced their training data.

The Lawsuit Reflects Growing Author Resistance to AI Data Scraping

This legal action represents a broader trend of authors and creators pushing back against AI companies that use their work without permission. The creative community has grown increasingly concerned about how their intellectual property is being harvested for AI training. Many authors argue that AI models trained on their works without compensation undermine their livelihoods and devalue their creative output. The lawsuit against Anthropic is just one of several cases challenging AI companies' data practices. As the AI industry continues to expand, these legal battles will likely become more frequent and more consequential for how AI models are developed and deployed in the future.

The Northern District of California Has Become Key Venue for AI Cases

The lawsuit was filed in the US District Court for the Northern District of California, which has become a central venue for technology-related legal disputes. This district covers the San Francisco Bay Area, home to many of the world's largest AI companies, including Anthropic. The court has already handled several significant AI copyright cases, including the Bartz v. Anthropic matter that resulted in the $1.5 billion settlement. The concentration of AI litigation in this district means that judges are developing expertise in these complex legal issues. The outcome of this latest case could set important precedents for how copyright law applies to AI development in the United States.

Anthropic's Legal Troubles Could Affect Its Record Valuation

The timing of this lawsuit is significant given Anthropic's recent financial success. The company recently surpassed OpenAI as the world's most valuable AI startup with a $965 billion valuation. This legal challenge could introduce uncertainty at a critical moment in Anthropic's growth trajectory. Investors and enterprise customers may watch the case closely to assess potential legal and financial risks. However, the company has shown resilience in facing previous legal challenges. The Bartz case settlement, while substantial, did not prevent Anthropic from achieving its record valuation. The outcome of this new lawsuit will depend on how the court interprets the procurement-phase infringement theory.

The Lawsuit Could Shape How AI Companies Build Training Datasets

Regardless of the outcome, this lawsuit is already influencing discussions about how AI companies should source their training data. The case highlights the risks of using pirated materials to build commercial AI products. Many AI companies have been pressured to improve their data practices and establish clearer provenance for their training datasets. Some have begun licensing content from publishers, while others are developing tools to ensure their training data has appropriate authorization. The legal scrutiny around data acquisition methods may accelerate these efforts across the industry. This case specifically addresses an issue that courts have only begun to examine in detail.

The Authors' Group Includes Writers Who Opted Out of Previous Settlement

The group of authors in this new lawsuit is particularly notable because they specifically opted out of the earlier Bartz settlement. Opting out allowed these writers to pursue their own legal action against Anthropic. This suggests they were not satisfied with the terms of the $1.5 billion settlement negotiated in the previous case. The authors are led by Thomas William Shakespeare, who is serving as the named plaintiff in the complaint. Their decision to pursue separate litigation indicates that some authors believe they can achieve better outcomes through their own legal strategy. This approach allows them to target the data procurement process directly rather than accepting the terms of a broader class action settlement.

The Docket Number Remains Unconfirmed as Case Proceeds

As of the latest reports, the docket number for this new lawsuit has not been confirmed. The lack of a publicly available docket number makes it difficult to access the full complaint text and verify all allegations. Some details reported in the media have not been independently confirmed against the actual court filing. However, multiple news outlets have reported on the lawsuit's existence and its key allegations. The case was filed on June 17, 2026, and will proceed through the Northern District of California court system. As the case develops and documents become publicly available, more details about the specific claims and evidence will emerge.

This Case Differs From the Ongoing Claude Usage Limits Lawsuit

It is important to distinguish this copyright case from other ongoing legal actions against Anthropic. A separate lawsuit has been filed by a Claude AI subscriber who alleges misleading marketing about usage limits on premium plans. That subscriber claims the actual usage was significantly lower than what Anthropic promised in its marketing materials. The subscriber is seeking class-action status to represent others affected since April 2025. While both cases involve Anthropic, they address completely different legal issues. The copyright lawsuit focuses on data acquisition and intellectual property, while the subscriber lawsuit deals with consumer protection and marketing practices.

Source & AI Information: External links in this article are provided for informational reference to authoritative sources. This content was drafted with the assistance of Artificial Intelligence tools to ensure comprehensive coverage, and subsequently reviewed by a human editor prior to publication.

About the Author & Admin ✍️

Ai Tester/Evaluator • Blogger • Domain Investor/Analyst • Web Developer • Digital Content Creator • News Editor/Publisher • 37+ Years of Experience in the Fields of Technology, Sociology & Digital Activities

Post a Comment

0 Comments