九一星空无限

ZB ZB
Opinion
Live now
Start time
Playing for
End time
Listen live
Up next
ZB

Kiwi authors caught in US$1.5b AI book piracy payout

Author
Bill Hickman - RNZ ,
Publish Date
Wed, 3 Dec 2025, 1:38pm
Catherine Chidgey at the Ockham New Zealand Book Awards, 2023. Photo / Marcel Tromp
Catherine Chidgey at the Ockham New Zealand Book Awards, 2023. Photo / Marcel Tromp

Kiwi authors caught in US$1.5b AI book piracy payout

Author
Bill Hickman - RNZ ,
Publish Date
Wed, 3 Dec 2025, 1:38pm

By Bill Hickman of RNZ 

A Kiwi author included in a US billion-dollar settlement over the illegal downloading of books to build an AI chatbot says she hopes the case will be a warning to the AI industry. 

But the Society of Authors said the authors of millions of titles used to build the tools will miss out on compensation as they are not registered for copyright in American territories. 

Anthropic AI has agreed to pay out up to US$1.5 billion ($2.6b) to settle claims it used millions of pirated books to train its large language models, in a class action in Californian courts. 

Award-winning author Catherine Chidgey said she received an email saying her books Remote Sympathy, The Wishchild and The Transformation had been identified as being caught up in the case. 

She said authors were being offered a payment of US$3000 ($5240) for each title accessed by the company. 

鈥淥n the one hand, I鈥檓 grateful that they鈥檙e being held to account. The works they accessed were from two websites that held pirated works so they weren鈥檛 accessing them legally. It鈥檚 not like they went out and paid for them. 

鈥淚 imagine this will serve as a warning to others out there who are hoovering up intellectual property without asking. This has been going on for a while now; it鈥檚 just that [Anthropic] are the first company that鈥檚 been held to account for it.鈥 

Anthropic AI has agreed to pay out up to $2.6 billion to settle claims it used millions of pirated books to train its large language models, in a class action in Californian courts. Photo / Getty Images

Anthropic AI has agreed to pay out up to $2.6 billion to settle claims it used millions of pirated books to train its large language models, in a class action in Californian courts. Photo / Getty Images 

She said the money offered to authors paled in comparison to the toil behind creating the works. 

鈥淚t鈥檚 not really enough 鈥 if you think about the years of effort I鈥檝e put into writing those books 鈥 but I鈥檓 glad that there has been a line drawn in the sand.鈥 

鈥楢 slap on the wrist鈥 

RNZ AI commentator Peter Griffin said the ruling did not counter the use of intellectual property for training AI models without permission. 

He said instead the settlement had pivoted on Anthropic鈥檚 use of two book pirating websites (Library Genesis and Pirate Library Mirror) to source the content. 

鈥淭his boils down to basically a fine for using dodgy websites and - in the scheme of things 鈥 $1.5b is not a lot for a company like Anthropic. That鈥檚 a slap on the wrist with a wet bus ticket. 

鈥淔undamentally, the things that [the AI companies] were terrified about was a ruling saying you cannot use this material under copyright law 鈥 this is not 鈥榝air use鈥. That what鈥檚 really scared them and that did not happen. 

鈥淯ltimately, it says to the AI companies, 鈥榝ill your boots, you can still use all of these texts to inform AI training models but if you鈥檙e ever caught doing that through dodgy pirate websites 鈥 or peer to peer file sharing networks 鈥 you鈥檙e going to face repercussions鈥.鈥 

He said companies recognised the need to move quickly to distinguish themselves in the rapidly evolving field. 

鈥淢any of these companies take the view 鈥 in the famous words of Mark Zuckerberg 鈥 鈥榤ove fast and break things鈥. 

鈥淭hey needed to get a model up quickly that would wow the world 鈥 that would be really useful 鈥 so they just hoovered up anything that they could get their hands on. Now there鈥檚 this sort of rearguard action by content owners and publishers to try and scramble to retain some of that value.鈥 

However, Griffin said the need for AI tools to be continuously trained on new data could still 鈥渄raw a line in the sand鈥 for AI use of intellectual property in the future. 

鈥淚t鈥檚 going to be a very difficult process and I don鈥檛 think publishers and authors will get anywhere near what they deserve to get but at least now there is a precedent that 鈥榮ure, it may be fair use but you have to legitimately obtain those copyrighted texts鈥. 

鈥淭he only way they can really do that at scale is to strike a deal with the publishers so they can really get that done in a legitimate way.鈥 

Thousand of authors ineligible for compensation 

New Zealand Society of Authors chief executive Jenny Nagle says while thousands of New Zealand titles may have been used by the company it was likely only dozens of local authors would be eligible for compensation. 

鈥淚 have records of about 3500 New Zealand titles that formed part of that data set that was uploaded. However, the settlement in the US courts decreed that only books registered with the US copyright office were eligible to be part of the settlement. This was appealed by lawyers from the Society of Authors from around the world because books from all countries, all copyright jurisdictions and languages, are part of this data set [but] it was denied by the US courts. 

鈥淢y understanding is there was about 7.5 million books in this pirated library and about 1.5 million will be eligible for compensation through this settlement.鈥 

Nagle said the society was encouraging members to ensure their books were registered through the US copyright office for future claims. 

鈥淭his is but the first settlement of scores of court cases that are in train for this issue. Nearly all of the larger language models have been trained by pirated libraries. They鈥檝e been trained by copyright theft. 

鈥淚f you are a producer of a product generally you pay for the ingredients of your product and in this case the AI developers have said 鈥榝air use, we can scrape whatever we want and we don鈥檛 have to pay鈥 but this is people鈥檚 property.鈥 

Nagle said the Government needed to act quickly but the current review of the Copyright Act was unlikely to address issues around AI until its second stage in 2027. 

鈥淚t is more urgent than that. The Australian Government has just moved to say that data mining and scraping ingestion is not a fair use 鈥 or fair dealing copyright - and needs licensing and we would really look to the Government to do something about this. 

鈥淭his is a rapidly evolving space, of course, but really we do need legislation and regulation because the development is just the wild west at the moment.鈥 

Author Rose Carlyle, whose works The Girl in the Mirror and No One Will Know were published in the US, is included in the class action and due for a payment. 

鈥淭hey took from us what we slaved over for years,鈥 she told Morning Report. 鈥淚t really guts me that that has just been taken and used to make so much money by billionaire companies. They could have bought the book but they chose not to.鈥 

- RNZ 

Take your Radio, Podcasts and Music with you