Two authors are suing OpenAI for training ChatGPT with their books. Could they win?
- Written by Dilan Thampapillai, Associate Professor, University of New South Wales, UNSW Sydney
Imagine you read a book. You commit details of the book to memory and ruminate on the ideas contained in it.
Somebody then asks you a question about the book. You provide them with a written response.
Would you be surprised if the author of the book tried to sue you for copyright infringement?
OpenAI is facing exactly this situation.
Authors Mona Awad (Bunny, 13 Ways of Looking at a Fat Girl) and Paul Tremblay (The Cabin at the End of the World), filed a lawsuit against OpenAI last week, claiming the books were used to train ChatGPT, its artificial intelligence software, without their consent.
It is the first lawsuit against ChatGPT that concerns copyright, The Guardian reported.
The only difference from the scenario I’ve outlined is that instead of a human reading a book, OpenAI is accused of allowing its AI program to copy a book to its internal database and train on it.
What’s the lawsuit’s chance of success?
OpenAI is a large language model (LLM). These LLMs train on data in the form of written works in order to provide natural language responses to prompts.
The basis of the lawsuit is that OpenAI trained itself on their novels and produced accurate summaries of their works when prompted.
Notably, the lawsuit does not specify which specific parts of Awad and Tremblay’s novels have been unlawfully copied and reproduced in the summaries.



















