Meta's LLaMA and the Copyright Tsunami: A Pirate Bay for AI?

2025-02-11
Meta's LLaMA and the Copyright Tsunami: A Pirate Bay for AI?

Authors are suing various Large Language Model (LLM) vendors, claiming copyright infringement in the training data. The evidence points to Meta's LLaMA, which used Books3 from Bibliotik – a private tracker containing massive amounts of pirated books. Meta's own paper admits to using Books3, essentially confessing to training on unauthorized intellectual property. This sparks debate on AI fair use and copyright, but the core issue remains: should an AI openly admitting to using pirated data face legal consequences?

AI