Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal
Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal
www.wired.com Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal
One of the most important AI copyright legal battles just took a major turn.

You're viewing a single thread.
All Comments
28 comments
The notorious piracy database in question is Library Genesis.
Cached article:
88 0 ReplyEarlier reports suggested they trained it on books from Bibliotik.
What changed?
15 0 ReplyProbably just both honestly.
25 0 ReplyIn for a penny and for a pound.
1 0 Reply
The llama-1 paper acknowledged the use of the books dataset, libgen isn't mentioned in any of the papers so this is new info.
3 0 Reply
28 comments
Scroll to top