Meta's Llama 3 Trained on Pirated Data: Internal Documents Reveal Zuckerberg's Approval

Popular：

Virtualization DNS security formal verification reachability analysis compiler errors macro conflict web extension development framework Bitmap Graphics API inconsistencies All Tags

Meta's Llama 3 Trained on Pirated Data: Internal Documents Reveal Zuckerberg's Approval

2025-01-19

Newly unsealed documents reveal that Meta trained its Llama 3 large language model using copyrighted material from the pirated library Library Genesis (LibGen). Despite internal concerns, CEO Mark Zuckerberg approved the use of this data. This decision exposes Meta to potential copyright lawsuits and negative publicity, highlighting broader concerns about the ethical sourcing of data in AI development.

(www.rollingstone.com)

TikTok's Demise: Why Relying on Proprietary Platforms is a Risky Gamble

The Surprising Struggle with UTC Time Strings in C/C++