Authors Grady Hendrix and Jennifer Roberson have filed a class action lawsuit in the Northern District of California against Apple for copyright infringement using their books to train its LLM. The lawsuit asserts that Apple used the pirated dataset Books3 to train its language models, and that the company’s Applebot software scraped pirate sites to obtain copyrighted books. It also notes that Apple entered a licensing deal with Shutterstock to train its genAI tools, but not with authors. “Apple did not compensate creators for use of their copyrighted works and concealed the sources of their training datasets to evade legal […]