Zuckerberg Approved Use of Pirated Books to Train AI, Suit Claims

Meta ceo Mark Zuckerberg himself knew that the books used to train the company’s AI tool were pirated, according to newly-public documents in one of the California class-action lawsuits against the tech company. Attorneys for a group of author plaintiffs (including Richard Kadrey, Sarah Silverman, Andrew Sean Greer, Ta-Nehisi Coates, and Jacqueline Woodson) assert that Zuckerberg approved the use of a dataset from shadow library LibGen to train their Llama LLM, despite concerns from other employees about it provenance. “Meta has treated the so-called ‘public availability’ of shadow datasets as a get out of jail free card, notwithstanding that internal […]